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Abstract 

We study several stochastic combinatorial problems, including the expected utility maxi- 
mization problem, the stochastic knapsack problem and the stochastic bin packing problem. 
A common technical challenge in these problems is to optimize some function (other than the 
expectation) of the sum of a set of random variables. The difficulty is mainly due to the fact 
^vq . that the probability distribution of the sum is the convolution of a set of distributions, which 

is not an easy objective function to work with. To tackle this difficulty, we introduce the Pois- 
Q ■ son approximation technique. The technique is based on the Poisson approximation theorem 

discovered by Le Cam, which enables us to approximate the distribution of the sum of a set of 
random variables using a compound Poisson distribution. Using the technique, we can reduce a 
variety of stochastic problems to the corresponding deterministic multiple-objective problems, 
which either can be solved by standard dynamic programming or have known solutions in the 
literature. For the problems mentioned above, we obtain the following results: 

1. We first study the expected utility maximization problem introduced recently [Li and 
Despande, FOCS11]. For monotone and Lipschitz utility functions, we obtain an additive 
PTAS if there is a multidimensional PTAS for the multi-objective version of the problem, 
strictly generalizing the previous result. The result implies the first additive PTAS for 

■ maximizing threshold probability for the stochastic versions of global min-cut, matroid 

, base and matroid intersection. 

2. For the stochastic bin packing problem (introduced in [Kleinberg, Rabani and Tardos, 
STOC97]), we show there is a polynomial time algorithm which uses at most the optimal 
number of bins, if we relax the bin size and the overflow probability by e for any constant 
e > 0. Based on this result, we obtain a 3-approximation if only the bin size can be relaxed 
by e, improving the known 0(1/ e) factor for constant overflow probability. 

3. For the stochastic knapsack problem, we show a (f + e)-approximation using e extra ca- 
pacity for any e > 0, even when the size and reward of each item may be correlated and 
cancelations of items are allowed. This generalizes the previous work [Balghat, Goel and 

i—t I Khanna, SODAf 1] for the case without correlation and cancelation. Our algorithm is also 

simpler. We also present a factor 2 + e approximation algorithm for stochastic knapsack 
with cancelations, for any constant e > 0, improving the current known approximation 
factor of 8 [Gupta, Krishnaswamy, Molinaro and Ravi, FOCSlf]. 

4. We also study an interesting variant of the stochas- tic knapsack problem, where the size 
and the profit of each item are revealed before the decision is made. The problem falls 
into the framework of Bayesian on- line selection problems, which has been studied a lot 
recently. We obtain in polynomial time a (f + e)-approximate policy using e extra capacity 
for any constant e > 0. 

Lastly, we remark that the Poisson approximation technique is quite easy to apply and may find 
other applications in stochastic combinatorial optimization. 
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1 Introduction 



We study several stochastic combinatorial optimization problems, including the threshold prob- 
ability maximization problem [52} 1501 145] . the expected utility maximization problem [45 j . the 
stochastic knapsack problem [24|, [T3"l [35] , the stochastic bin packing problem |41[ [3Tj and some of 
their variants. All of these problems are known to be #P-hard and we are interested in obtaining 
approximation algorithms with provable performance guarantees. We observe a common technical 
challenge in solving these problems, that is, roughly speaking, given a set of random variables with 
possibly different probability distributions, to find a subset of random variables such that certain 
functional (other than the expectation ) of their sum is optimized. The difficulty is mainly due to 
the fact that the probability distribution of the sum is the convolution of the distributions of indi- 
vidual random variables. To address this issue, a number of techniques have been proposed (briefly 
reviewed in the related work section). In this paper, we introduce a new technique, called the 
Poisson approximation technique, which can be used to approximate the probability distribution 
of a sum of several random variables. The technique is very easy to use and yields better or more 
general results than the previous techniques for a variety of stochastic combinatorial optimization 
problems mentioned above. In the rest of the section, we formally introduce these problems and 
state our results. 

Terminology: We first set up some notations and review some standard terminologies. Following 
the literature, the exact version of a problem 21 (denoted as Exact-2l) asks for a feasible solution 
of 21 with weight exactly equal to a given number K. An algorithm runs in pseudopolynomial time 
for Exact- 21 if the running time of the algorithm is bounded by a polynomial of n and K. 

A polynomial time approximation scheme (PTAS) is an algorithm which takes an instance of a 
maximization problem and a parameter e and produces a solution whose cost is at least (1 — e)OPT, 
and the running time, for any fixed e, is polynomial in the size of the input. If e appears as an 
additive factor in the above definition, namely the cost of the solution is at least OPT — e, we 
say the algorithm is an additive PTAS. We say a PTAS is a fully polynomial time approximation 
scheme (FPTAS) if the running time is polynomial in the size of the input and -. 

In a multidimensional minimization problem, each element b is associated with a weight vector 
(wb,i, Wb,2, ■ ■ •)• We are also given a budget vector (B\,B2, ■ ■ ■)■ The goal is to find a feasible solution 
5 € J 7 such that Ylbes w b,i — -BjVi. We use Multi-2l to denote the problem if the corresponding 
single dimensional optimization problem is 21. A multidimensional PTAS for Multi is an algorithm 
which either returns a feasible solution S € T such that Y2beS w b,i < (1 + e)BjVi, or asserts that 
there is no feasible solution S' with YlbeS' w b,i — &i 

1.1 Expected Utility Maximization 

We first consider the fixed set model of a class of stochastic optimization problems introduced in 
|45j . We are given a ground set of elements (or items) B = {fej}i=i...n- Each feasible solution to the 
problem is a subset of the elements satisfying some property. In the deterministic version of the 
problem, we want to find a feasible solution S with the minimum total weight. Many combinatorial 
problems such as shortest path, minimum spanning tree, and minimum weight matching belong to 
this class. In the stochastic version, each element b is associated with a random weight Xb- We 

1 We can use the linearity of expectation to circumvent the difficulty of convolution. 
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assume all XjyS are discrete nonnegative random variables and are independent of each other. We 
are also given a utility function /x : R — > M + to capture different risk-averse or risk-prone behaviors 
that are commonly observed in decision-making under uncertainty. Our goal is to to find a feasible 
set S such that the expected utility K[n(X(S))] is maximized, where X(S) = Ylbes-^b- We refer 
to this problem as the expected utility maximization (EUM) problem. An important special case is 
to find a feasible set S such that Pi[X(S) < 1] is maximized, which we call the threshold probability 
maximization (TPM) problem. Note that if fJ.(x) = 1 for x € [0, 1]; n(x) = for x > 1, we have 
that Pr[X(S) < 1] = E[/j,(X(S))]. In fact, this special case has been studied extensive in literature 
for various combinatorial problems including stochastic versions of shortest path [52], minimum 
spanning tree [BHJ [30] , knapsack [31] as well as some other problems [H [50] . We use 21 to denote the 
deterministic version of the optimization problem under consideration, and use accordingly EUM-21 
and TPM-21 to denote the expected utility maximization problem and the threshold probability 
maximization problem for 21 respectively. 

Our Results: Following the previous work [45j . we assume lrnxj^oo fi(x) = (if the weight of 
our solution is too large, it is almost useless). We also assume fj, is a-Lipschitz in [0, oo), i.e., 
— < a\x — y\ for any x, y G [0, oo), where a is a positive constant. Our first result is an 
alternative proof for the main result in |45| . 

Theorem 1.1. Assume there is a pseudopolynomial time algorithm for Exact-'Qi. For any e > 0, 
there is a poly-time approximation algorithm for EUM-21 that finds a feasible solution S such that 

E[fi(X(S))} > OPT-e. 

For many combinatorial problems, including shortest path, spanning tree, matching and knap- 
sack, a pseudopolynomial algorithm for the exact version is known. Therefore, Theorem 11.11 im- 
mediately implies an additive PTAS for the EUM version of each of these problems. An important 
corollary of the above theorem is a relaxed additive PTAS for TPM: For any e > 0, we can find in 
polynomial time a feasible solution S such that 

Pt[X(S) < 1 + e] > OPT - e , 

provided that there is a pseudopolynomial time algorithm for Exact-2l. In fact, the corollary follows 
easily by considering the monotone utility function x(x) = 1 for x € [0,1]; xi x ) = — f + \ + 
1 for x G [1, 1 + e]; x( x ) = for x > 1 + e, which is ^-Lipschitz. We refer the interested reader 
to [15] for more implications of Theorem 11.11 However, this is not the end of story. Our second 
major result considers EUM with monotone nonincreasing utility functions, a natural class of utility 
functions (we denoted the problem as EUM-Mono). We can get the following strictly more general 
result for EUM-Mono. 

Theorem 1.2. Assume there is a multidimensional PTAS for Multi-%1. For any e > 0, there is a 
poly-time approximation algorithm for EUM-Mono-%1 that finds a feasible solution S such that 

E[fj,(X(S))] > OPT-e. 

It is worthwhile mentioning that condition of Theorem 11.21 is strictly more general than the 
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condition of Theorem I l.li It is known that if there is pseudopolynomial time algorithm for Exact- 
21, there is a multidimensional PTAS for Multi-2l, by Papadimitriou and Yannakakis [53]. However, 
the converse is not true. Consider the minimum cut (MC) problem. A pseudopolynomial time 
algorithm for Exact-MC would imply a polynomial time algorithm for the NP-hard MAX-CUT 
problem, while a multidimensional PTAS for Multi-MC is known [3]. Therefore, Theorem 11.21 implies 
the first relaxed additive PTAS for TPM-MC. Other problems that can justify the superiority of 
Theorem 11.21 include the matroid base (MB) problem and the matroid intersection (Ml) problem. 
Obtaining pseudopolynomial time exact algorithms for Exact-MB and Exact-MI is still open |14^l . 




while multidimensional PTASes for Multi-MB and Multi-MI are known [541 119]. 

We would like to remark that obtaining an additive PTAS for EUM-21 for non-monotone util- 
ity functions under the same condition as Theorem 11.21 is impossible. Consider again EUM-MC. 
Suppose the weights are deterministic. The given utility function is fi(x) = WOx — 89 for x € 
[0.89,0.9]; n(x) = -WOx + 91 for x € [0.9,0.91]; fi(x) = otherwise, (which is 100-Lipschitz) 
and the maximum cut of the given instance has a weight 0.9. So, the optimal utility is 1, but 
obtaining a utility value better than is equivalent to finding a cut of weight at least 0.89, which 
is impossible given the imapproximability result for MAX-CUT [40] . 

Our techniques: Our algorithm consists of two major steps, discretization and enumeration. Our 
discretization is similar to, yet much simpler than, the one developed by Bhalgat et al. [13] . In 
their work, they developed a technique which can discretize all (size) probability distributions into 
O(logra) equivalent classes. This is a difficult task and their technique applies several clever tricks 
and is quite involved. However, we only need to discretize the distributions so that the size of 
the support of each distribution is a constant, which is sufficient for the enumeration step. In the 
enumeration step, we distinguish the items with large expected weights (heavy items) and those 
with small expected weights (light items). We argue that there are very few heavy items in the 
optimal solution, so we can afford to enumerate all possibilities. To deal with light items, we invoke 
Le Cam's Poisson approximation theorem which (roughly) states that the distribution of the total 
size of the set of light items can be approximated by a compound Poisson distribution, which can be 
specified by the sum of the (discretized) distribution vectors of the light items (called the signature 
of the set). Therefore, instead of enumerating 0(2 n ) combinations of light items, we only need 
to enumerate all possible signatures and check whether there is a set of light items with the sum 
of their distribution vectors approximately equal to (or at most) the signature. To solve the later 
task, we need the pseudopolynomial time algorithm for Exact- 21 (or the multidimensional PTAS for 
Multi-2l). 

1.2 Stochastic Bin Packing 

In the stochastic bin packing (SBP) problem, we are given a set of items IB = {bi, 62, • • • , b n } and 
an overflow probability < p < 1. The size of each item b{ is an independent random variable Xi 
following a known discrete distribution. The distributions for different items may be different. Each 
bin has a capacity of C. The goal is to pack all the items in B using as few bins as possible such 
that the overflow probability for each bin is at most p. The problem was first studied by Kleinberg, 

2 Pseudopolynomial time algorithms are known only for some special cases, such as spanning trees matroids 
with parity conditions [14] . 
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Rabani and Tardos [4"T] . They obtained a 0(l/e)-approximation, for only Bernoulli distributed 
items, if we relax the bin size to (1 + e)C or the overflow probability to p l ~ e . They also obtained 
a O ( y^^j^pr ) -approximation without relaxing the bin size and the overflow probability. Goel 
and Indyk [31] obtained PTAS for both Poisson and exponential distributions and QPTAS (i.e., 
quasi-polynomial time) for Bernoulli distribution. 

Our Results: Our main result for SBP is the following theorem. 

Theorem 1.3. For any fixed constant e > 0, there is a polynomial time algorithm for SBP that 
uses at most the optimal number of bins, when the bin size is relaxed to (1 + e)C and the overflow 
probability is relaxed to p + e. 

To the best of our knowledge, our result is the first result for SBP for arbitrary discrete dis- 
tributions. Based on this result, we can get the following result when the overflow probability is 
not relaxed. For Bernoulli distributions, this improves the 0(l/e)-approximation in [51] for any 
constant p. 

Theorem 1.4. For any constant p > 0, we can find in polynomial time a packing that uses at most 
30PT bins of capacity (1 + e)C such that the overflow probability of each bin is at most p. 

Our technique: Our algorithm for SBP is similar to that for EUM. We distinguish the heavy items 
and the light items and use the Poisson approximation technique to deal with the light items. One 
key difference from EUM is that we have a linear number of bins, each of them may hold a constant 
number of heavy items. Therefore, we can not simply enumerate all configurations of the heavy 
items since there are exponential many of them. To reduce the number of the configurations to a 
polynomial, we classify the heavy items into a constant number of types (again, by discretization). 
For a fixed configuration of the heavy items, using the Poisson approximation, we reduce SBP to a 
multidimensional version of the multi-processors scheduling problem, called the vector scheduling 
problem, for which a PTAS is known |17j . 

1.3 Stochastic Knapsack 

The deterministic knapsack problem is a classical and fundamental problem in combinatorial opti- 
mization. In this problem, we are given as input a set of items each associated with a size and a 
profit, and our objective is to find a maximum profit subset of items whose total size is at most the 
capacity C of the knapsack. In many applications, the size and/or the profit of an item may not be 
fixed values and only their probability distributions are known to us in advance. The actual size 
and profit of an item are revealed to us as soon as it is inserted into the knapsack. For example, 
suppose we want to schedule a subset of n jobs on a single machine by a fixed deadline and the 
precise processing time and profit of a job are only revealed until it is completed. In the following, 
we use terms items and jobs interchangeably. If the insertion of an item causes the knapsack to 
overflow, we terminate and do not gain the profit of that item. The problem is broadly referred to 
as the stochastic knapsack (SK) problem [241 1131 [35] . Unlike the deterministic knapsack problem 
for which a solution is a subset of items, a solution to SK is specified by an adaptive policy which 
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determines which item to insert next based on the remaining capacity and and the set of available 
items. In contrast, a non-adaptive policy specifies a fixed permutation of items. 

A significant generalization of the problem, introduced in [35], considers the scenarios where 
the profit of a job can be correlated with its size and we can cancel a job during its execution in 
the policy. No profit is gathered from a canceled job. This generalization is referred as Stochastic 
Knapsack with Correlated Rewards and Cancelations (SK-CC). Stochastic knapsack and several of 
its variants have been studied extensively by the operation research community (see e.g., [25 j 126 1 l5j 
H]). In recent years, the problem has also attracted a lot of attention from the theoretical computer 
science community where researchers study the problems from the perspective of approximation 
algorithms [21 031 E5] . 

Our Results: For SK, Bhalgat, Goel and Khanna [13] obtained a (1 + e)-approximation using 
e extra capacity. We obtain an alternative proof of this result, using the Poisson approximation 
technique. The running time of our algorithm is n 

(/(e)) w here /(e) = 2^ ol ^ l / e \ improving upon 
the n°^^^° </<c)) running time in [13]. Our algorithm is also considerably simpler. 

Theorem 1.5. For any e > 0, there is a polynomial time algorithm that finds a (1 + e)- approximate 
adaptive policy for SK when the capacity is relaxed to (1 + e)C. 

Our next main result is a generalization of Theorem [L5] to SK-CC where the size and profit of an 
item may be correlated, and cancelation of item in the middle is allowed. The current best known 
result for SK-CC is a factor 8 approximation algorithm by Gupta, Krishnaswamy, Molinaro and 
Ravi [35], base on a new time-indexed LP relaxation. We remark that it is not clear how to extend 
the enumeration technique developed in [13J to handle cancelations (see a detailed discussion in 
Section [5]). 

Theorem 1.6. For any e > 0, there is a polynomial time algorithm that finds a (1 + e) -approximate 
adaptive policy for SK-CC when the capacity is relaxed to (1 + e)C. 

We use SK-Can to denote the stochastic knapsack problem where cancelations are allowed (the 
size and profit of an item are not correlated). Based on Theorem 11.61 and the algorithm in [12] . we 
obtain a generalization of the result in |12j as follows. 

Theorem 1.7. For any e > 0, there is a polynomial time algorithm that finds a (2 + e)- approximate 
adaptive policy for SK-Can. 

Bayesian Online Selection: The technique developed for SK-CC can be used to obtain the 
following result for an interesting variant of SK, where the size and the profit of an item are 
revealed before the decision whether to select the item is made. We call this problem the Bayesian 
online selection problem subject to a knapsack constraint (BOSP-KC). The problem falls into the 
framework of Bayesian online selection problems (BOSP) formulated in [42J. BOSP problems subject 
to various constraints have attracted a lot of attention due to their applications to mechanism design 
[36\ [161 121 E2]- BOSP-KC also has a close relation with the knapsack secretary problem [6] (See 
Section [6] for a discussion) . 

Theorem 1.8. For any e > 0, there is a polynomial time algorithm that finds a (1 + e) -approximate 
adaptive policy for BOSP-KC when the capacity is relaxed to 1 + e. 
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As a byproduct of our discretization procedure, we also give a linear time FPTAS for the 
stochastic knapsack problem where each item has unlimited number of copies (denoted as SK-U), 
if we relax the knapsack capacity by e. The problem has been studied extensively under different 
names and optimal adaptive policies are known for several special distributions \25\ [26j 0]. 
However, no algorithmic result about general (discrete and continuous) distributions is known 
before. The details can be found in Appendix iDl 

Our techniques: SK-CC and BOSP-KC are more technically interesting since their solutions are 
adaptive policies, which do not necessarily have polynomial size representations. So it is not even 
clear at first sight where to use the Poisson approximation technique. As before, we first discretize 
the distributions. In the second step, we attempt to enumerate all possible block- adaptive policies, 
a notion which is introduced in |13| . In a block-adaptive policy, instead of inserting them items one 
by one, we insert the items block by block. In terms of the decision tree of a policy, each node in the 
tree corresponding to the insertion of a block of items. A remarkable discovery in [13] is that there 
exists a block-adaptive policy that approximates the optimal policy and has only O(l) blocks in 
the decision tree (the constant depends on e) for SK. However, their proof does not easily generalize 
to SK-CC. We extend their result to SK-CC with a essentially different proof, which might be of 
independent interest. Fixing the topology of the decision tree of the block-adaptive policy, we can 
enumerate the signatures of all blocks in polynomial time, and check for each signature whether 
there exists a block-adaptive policy with the signature using dynamic programming. Again, in the 
analysis, we use the Poisson approximation theorem to argue that two block adaptive policies with 
the same tree topology and signatures behave similarly. 

1.4 Other Related Work 

Recently, stochastic combinatorial optimization problems have drawn much attention from the the- 
oretical computer science community. In particular, the two-stage stochastic optimization models 
for many classical combinatorial problem have been studied extensively. We refer interested reader 
to [57] for a comprehensive survey. 

There is a large body of literature on EUM and TPM, especially for specific combinatorial prob- 
lems and/or special utility functions. Loui [47] showed that the EUM version of the shortest path 
problem reduces to the ordinary shortest path (and sometimes longest path) problem if the utility 
function is linear or exponential. For the same problem, Nikolova, Brand and Karger [51] identified 
more specific utility and distribution combinations that can be solved optimally in polynomial time. 
Nikolova, Kelner, Brand and Mitzenmacher [52j studied the TPM version of shortest path when 
the distributions of the edge lengths are normal, Poisson or exponential. Nikolova [50] extended 
this result to an FPTAS for any problem for normal distributions, if the deterministic version of 
the problem has a polynomial time algorithm. Many heuristics for the stochastic shortest path 
problems have been proposed to deal with more general utility functions (see e.g., [48} I4U} 111]). 
However, either their running times are exponential in worst cases or there is no provable per- 
formance guarantee for the produced solution. The TPM version of the minimum spanning tree 
problem has been studied in |38|. 130], where polynomial time algorithms have been developed for 
Gaussian distributed edges. 

The bin packing problem is a classical NP-hard problem. It is well known that it is hard to 
approximate within a factor of 3/2 — e from a reduction from the subset sum problem. Alternatively, 
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the problem admits an asymptotic PTAS, i.e., it is possible to find in polynomial time a packing 
using at most (l + e)OPT + l bins for any e > [29] . The stochastic model where all items follow the 
same size distribution have been studied extensively in the literature (see, e.g., [201 [55]). However, 
these works require that the actual items sizes are revealed before put in the bins and their focus 
is to design simple rules that achieve nearly optimal packings. 

Kleinberg, Rabani and Tardos [H] first considered the fixed set version of the stochastic knap- 
sack problem with Bernoulli- type distributions. Their goal is to find a set of items with maximum 
total profit subject to the constraint that the overflow probability is at most a given parameter 7. 
They provided a polynomial-time 0(log l/7)-approximation. For exponentially distributed items, 
Goel and Indyk [31] presented a bi-criterion PTAS. Chekuri and Khanna [18] pointed out that 
a PTAS for Bernoulli distributed items can be obtained using their techniques for the multiple 
knapsack problem. For Gaussian distributions, Goyal and Ravi [32] obtained a PTAS. 

The adaptive stochastic knapsack problem and several of it variants have been shown to be 
PSPACE-hard [23], which implies that it is impossible in polynomial time to construct an optimal 
adaptive policy, which may be exponentially large and arbitrarily complicated. Dean, Goemans, 
and Vondrak [24] first studied SK from the perspective of approximation algorithms and gave an 
algorithm with an approximation factor of 3 + e. In fact, their algorithm produces a non- adaptive 
policy (a permutation of items) which implies the adaptivity gap of the problem, the maximum 
ratio between the expected values achieved by the best adaptive and non-adaptive strategies, is 
a constant. Using the technique developed for the (1 + e)-approximation using e extra capacity, 
Bhalgat, Goel and Khanna [13] also gave an improved (| + e)-approximation without extra capacity. 
Stochastic multidimensional knapsack (also called stochastic packing) has been also studied [23] 
[T3"l 18]. The stochastic knapsack problem can be formulated as an exponential-size Markov decision 
process (MDP). Recently, there is a growing literature on approximating the optimal policies for 
exponential-size MDPs in theoretical computer science literature (see e.g., [241 1341 [35] 142]). 

BOSP problems are often associated with the name prophet inequalities since the solutions of the 
online algorithms are often compared with "the prophet's solutions" (i.e., the offline optimum). The 
prophet inequalities were proposed in the seminal work of Krengel and Sucheston |43] and have been 
studied extensively since then. The secretary problem is a also classical online selection problem 
introduced by Dynkin [28] . Recently, both problems enjoy a revival due to their connections to 
mechanism design and many generalizations have been studied extensively [71 [6j [37] [151 ESI ESI EH 
[21 02]. We note that performances in all the work mentioned above are measured by comparing 
the solutions of the online policies with the offline optimum. Complementarily, our work compares 
our policies with the optimal online policies. 

Finally, we would like to point out that Daskalakis and Papadimitrious [21] recently used Pois- 
son approximation in approximating mixed Nash Equilibria in anonymous games. However, the 
problem and the technique developed there are very different from this paper. 

Prior Techniques: As mentioned in the introduction, it is a common challenge to deal with 
the convolution of a set of random variables (directly or indirectly). To address this issue, a 
number of techniques have been developed in the literature. Most of them only work for special 
distributions [33 ED E21 EU [33], [50] , such as Gaussian, exponential, Poisson and so on. There are 
much fewer techniques that work for general distributions. Among those, the effective bandwidth 
technique [41] and the linear programming technique [241 El ES] have proven to be quite powerful 
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for many problems, but the approximation factors obtained are constants at best (no exception is 
known so far). In order to obtain (multiplicative/additive) PTAS, two techniques are developed very 
recently: one is the discretization technique [13] for the stochastic knapsack problem and the other 
is the Fourier decomposition technique [45] for the utility maximization problem. However, both 
of them have certain limitations. The discretization technique |13] typically reduces a stochastic 
optimization problem to a complicated enumeration problem (in some sense, it is an O(logn)- 
dimensional optimization problem since the distributions are discretized into O(logn) equivalent 
classes). If the structure of the problem is different from or has more constraints than the knapsack 
problem, the enumeration problem can become overly complicated or even intractable (for example, 
the SK-CC problem or the TPM version of the shortest path problem). In the Fourier decomposition 
technique [45] . due to the presence of complex numbers, we lose certain monotonicity property in 
the reduction from the stochastic optimization problem to an deterministic optimization problem, 
thus it is impossible to obtain something like Theorem 1 1 . 2 1 using that technique. 

2 Expected Utility Maximization 

We prove Theorem 11.11 and Theorem 11.21 in this section. For each item b € B, we use 7T{,(.) to 
denote probability distribution of the weight of b. We use J- C 2 B to denote the set of feasible 
solutions. For example, in the minimum spanning tree problem, B is the set of edges and J- is the 
set of all spanning trees. Since we are satisfied with an e additive approximation, we can assume 
w.l.o.g. the utility function n(x) = if x > C for some constant C (e.g., we can choose C to be a 
constant such that fj,(x) < e if x > C). The support of 7r&(.) is assumed to be a subset of [0,C]. 
By scaling, we can assume C = 1 and < /x(x) < 1 for x > 0. We also assume fi is a-Lipschitz 
where a is a constant that does not depend on e. It is straightforward to extend our analysis to 
the case where a depends on e. We first consider the general EUM problem and then focus on the 
EUM-Mono problem where the utility function is monotone nonincr easing. 

We start by bounding the total expected size of solution S if /J,(S) is not negligible. This directly 
translates to an upper bound of the number of items with large expected weight in S, which we 
handle separately. The proof is fairly standard and can be found in the appendix. 

Lemma 2.1. Suppose each item b € B has a non-negative random weight Xb taking values from 
[0,oC)] for some a > 1. Then, \/S C U, V± > e > 0, ifE\p(X(S))] > e, then E [X(S)\ < 3a/ e. 

Let S* denote the optimal feasible set and OPT the optimal value. If OPT = E[//(X(S'*))] < e, 
then any feasible solution achieves the desired approximation guarantee since OPT — e < 0. Hence, 
we focus on the other case where OPT > e. We call an item b heavy item if E[-X&] > e 10 . Otherwise 
we call it light. By Lemma 12.11 we can see that the number of heavy items in S* is at most -fr • 

Enumerating Heavy Elements We enumerate all possible set of heavy items with size at most 3/e 11 . 
There are at most re 3 / 6 " such possibilities. Suppose we successfully guess the set of heavy items in 
S* . In the following parts, we mainly consider the question that given a set H of heavy items, how 
to choose a set L of light items such that their union S is a feasible solution, and E[/i(S*)] is close 
to optimal. 

Dealing with Light Elements Unlike heavy items, there may be many light items in S* , which 
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makes the enumeration computationally prohibitive. Our algorithm consists of the following steps. 
First, we discretize the weight distributions of all items. After the discretization, there are only a 
constant number of discretized weight values in [0, C]. The discretized distribution can be thought 
as a vector with constant dimensions. Then, we argue that for a set L of light items (with certain 
conditions), the distribution of the sum of their discretized weights behaves similarly to a single 
item whose weight follows a compound Poisson distribution. The compound Poisson distribution is 
completely determined by a constant dimensional vector (which we call the signature of L) which 
is the sum of the distribution vectors in L. The argument is carried out by using the Poisson 
approximation theorem developed by Le Cam [44]. Then, our task amounts to enumerating all 
possible signatures, and checking whether there is a set L of light items with the signature and 
H U L is a feasible set in T . Since the number of possible signatures is polynomial, our algorithm 
runs in polynomial time. Now, we present the details of our discretization method. 



2.1 Discretization 

In this section, we discuss how to discretize the size distributions for items, using parameter e. 
W.l.o.g., we assume the range of Xf, is [0, C] for all b. Our discretization is similar in many parts 
to the one in |13j . however, ours is much simpler. 

For item b, we say b realizes to a "large" size if Xb > e 4 . Otherwise we say b realizes to a "small" 
size. The discretization consists of two steps. We discretize the small size region in step 1 and the 
large size region in step 2. We use Xb to denote the size after discretization and nf, its distribution. 

Step 1. Small size region In the small size region, Xb follows a Bernoulli distribution, taking 
only values and e 4 . The probability values Pr[Xb = 0] and PrpQ, = e 4 ] are set such that 

E[X b \X b <e i ]=E[X b \X b <e% 

More formally, suppose w.l.o.g. that there is a value < d < e 4 such that Pr[X 6 > d \ X b < 
e 4 ] • e 4 = E[X b | X b < e 4 ]. We create a mapping between X b and X b as follows: 



X u 



0, < X b < d; 
e 4 , d < X b < e 4 ; 
X b , X b > e 4 . 



In the appendix, we discuss the case where such value d does not exist. 

Step 2. Large size region If X b realizes to a large size, we simply discretize it as follows: Let 
Xb = (i.e., we round a large size down to a multiple of e 5 ). 

The above two discretization steps are used throughout this paper. We denote the set of the 
discretized sizes by S = {so,si, . . . ,s^_i} where so = 0, si = e 5 ,S2 = 2e 5 ,S3 = 3e 5 , . . . ,s^_i. Note 
that si = e 5 , . . . ,si/ € _i = e 4 — e 5 are also included in S, even though their probability is 0. It is 
straightforward to see that |«S| = z = 0(C/e 5 ). This finishes the description of the discretization. 

The following lemma states that for a set of items, the behavior of the sum of their discretized 
distributions is very close to that of their original distributions. 

Lemma 2.2. Let S be a set of items such that E [X(S')] < 3/e. For any < (3 < C, we have that 
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1. Pt[X(S) <P}< Pt[X(S) < P + e] + O(e); 

2. Pt[X{S) < 0\ < Pr[X(S) < p + e] + O(e). 

Lemma 2.3. For any set S of items such that E [X(S')] < 3/e, 

E[fx(X(S))]-E[fM(X(S))] \ =0(e). 

Proof. For a set S, we use Ps{x) and Ps(x) to denote the CDFs of X(S) and X(S) respectively. 
We first observe that 



EMX(S))]--EMX(S))] 



fi(x)dP S (x) - / fl(x)dP S (a 

Jo 

P S (a;)d/x(x) - / P s (s)d/i( 3 
Jo 

(P s (x) -P s (x))dx 



< a 



The second equation follows from applying integration by parts and the last is because \i is a- 
Lipschitz. From Lemma 12.21 we can see that 



(P s (x) - P S (x))dx 



< 0(e). 



In fact, the above can be seen as follows: 



(P s {x) - P s {x))dx 



(P s (x)-P s (x + e))dx- P s (x)dx + 
o Jo 



Ps(x)dx < 0(e). 



□ 



The proof for the other direction is similar and we omit it here. 

2.2 Poisson Approximation 

For an item b, we define its signature to be the vector 

Sg(6) = (7r&(si),7fb(s 2 ),7f"b(s 3 ), . . . ,7f b (s 2 __l)), 

where 7r&(s) = |_^&( s ) ' ;prj ' f° r an nonzero discretized size s £ S \ {0} = {si,S2, . . . , s 2 _i}. For a 
set S of items, its signature is defined to be the sum of the signatures of all items in S, i.e., 

s g (s) = ]Tsg( ). 

bes 

We use Sg(S)k to denote the fcth coordinate of Sg(5). By Lemma 12.11 5Zfe=i Sg(*S')ft • s& = 
X^fc=i/ e Sg(5)fc • Sfc < 3/e. Thus Sg(S)k < 3/e 5 for all fc. Therefore, the number of possible 

l*S I 1 

signatures is bounded by (3n/e n ) , which is polynomial in n. 
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For an item b, we let X& be the random variable that Pr [X& = s] = 7f&(s) for s = si, S2, . . . , s z _i, 
and Xf, = with the rest of the probability mass. Similarly, we use X(S) to denote ^&es -^-b for 
a set S of items. 

The following lemma shows that it is sufficient to enumerate all possible signatures for the set 
of light items. 



Lemma 2.4. Let Si, 5*2 be two sets of light items such that Sg(Si) = Sg(S 2 ) and E X(S±) 
3/e,E X(S 2 ) < 3/e. Then, the total variation distance between X (Si) and X(S 2 ) satisfies 



< 



a(x(S 1 ),X(S 2 )) =5Z |Pr \X(S 



Pr 



X(S 2 



0(e) 



The following Poisson approximation theorem by Le Cam [33], rephrased in our language, is es- 
sential for proving Lemma [2.41 Suppose we are given a K-dimensional vector V = (V±, . . . , Vk)- Let 



A = YliLi Vi- we sa Y a random variable Y follows the compound Poisson distribution corresponding 



to V if it is distributed as Y = Ylj=i w here follows Poisson distribution with expected value 
A (denoted as N ~ Pois(A) ) and Yi, . . . , Ijv are i.i.d. random variables with Pr[lj = 0] = and 
Pr[Y,- = k] = V fc /A for k € {1, . . . , K} and j € {1, . . . , N}. 



Lemma 2.5. |44| Let X\, X 2 , . . . be independent random variables taking integer values in {0, 1, K}, 
let X = Y J X l . Let vr; = Prpf, / 0] and V = (Vi, . . . , V K ) where V k = £\ Pr[X; = k]. Suppose 
A = = Ylk < oo. Let Y be the compound Poisson distribution corresponding to vector V. 

Then, the total variation distance between X and Y can be bounded as follows: 



A[X,YJ = JjPrLX = k] - Pt[Y 



k] 



k>0 



Proof of Lemma \2.4\ By definition of X b , we have that A (Xb,Xf,j < e/n for any item b. Since S\ 
and S 2 contains at most n items, by the standard coupling argument, we have that 



A(X(S 1 ),X(S 1 )) <e 



and 



A(X(S 2 ),X(S 2 )) < e . 



If we apply Lemma [231 to both X(S\) and X(S 2 ), we can see they both correspond to the same 
compound Poisson distribution, say Y, since their signatures are the same. Moreover, since the 
total variation distance is a metric, we have that 

a(x(s 1 ),x(s 2 )) < a(x(s 1 ),x(s 1 )) +a(x(s 1 ),y 



+A (Y,X(S 2 ) + A (X(S 2 ),X(S 2 : 



< e 



X b yt0 



beS! 



bes 2 



X b ^0 



+ e 



0(e). 
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The last equality holds since for any light item b, 



Pr 



< Pl- 



ane! 



Pr 



X b >e 4 



< E 



X b 



/e 4 < e 6 



bes 



< E 



X{S) /e 4 <3/e 5 for S = S U S 2 . 



□ 



2.3 Approximation Algorithm of EUM 

Now, everything is in place to present our approximation algorithm and the analysis. 



Discretize the size distributions of for all light items; 
Enumerate all possible heavy item sets H with ELX~(iT)l < 3/e; 
for each such H do 

Enumerate all possible signatures Sg; 
for each such Sg do 

(a) Try to find a set L of light items such that H U L € T (H U L is feasible) and 

|_ Sg(L) = Sg; 

Pick the feasible H U L with the largest E[[i(X(H U L))] ; 



Algorithm 1: Approximation Algorithm of EUM 

In step (a), we can use the pseudopolynomial time algorithm for the exact version of the problem 
to find a set L with the signature exact equal to Sg. Since Sg is a vector with 0(e -5 ) coordinates and 
the value of each coordinate is bounded by O(n), it can be encoded by an integer which is at most 
n O(e J ) Thus th e pseudopolynomial time algorithm actually runs in poly (n,n°^ e ') = n°( e > 
time, which is a polynomial. Since there are at most N°( e ) different heavy item sets and n°( e > 



different signatures, the algorithm runs in n 



0(e" 15 ) 



n 



poly(lA) 



time overall. Finally, we present the 



analysis of the performance guarantee of the algorithm. 

Proof of Theorem Assume the optimal feasible set is S* = H* U L* where items in H* are 
heavy and items in L* are light. Assume our algorithm has guessed H* correctly. Since there is 
a pseudopolynomial algorithm, we can find a set L of light items such that Sg(L) = Sg(L*). By 
Lemma \2A\ we know that A(X(L), X(L*) I = O(e). Therefore, we can get that 



A[X(LUH*),X(L*UH*)) =0(e 



Moreover, we have that 



E 



H{X{LUH*)) 



E 



X(L*UH*))^ | < M(s) \kluh* (s) - ^L'VJH* (s 
sG5 



< m&x^nix) ■ A[X(LUH*),X(L* U H*) 



0(e), 
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where tt$ is the PDF for X(S). It is time to derive our final result: 

| E[n{X(L* U H*))] - E[n(X(L U H*))} \ < E[n(X{L* U H*))) - E[fi{X(L* U H*))] 

+ E[fj,(X(L* U H*)) — E[fi(X(L U H*))]] 
+ E[/j,(X(L U H*)) — E[[i(X(L U H*))]) 
= 0(e) 

The inequality follows from Lemma 12.31 and 12.41 □ 
2.4 Approximation Algorithm for EUM-Mono 

We prove Theorem 11.21 in this subsection. Recall that EUM-Mono is a special case of EUM where 
the utility function \x is monotone nonincreasing. The algorithm is the same as that in EUM except 
we adopt the new step ( follows. 



(a) Try to find a set L of light items such that H U L € J- (H U L is feasible) and 
Sg(L) < (1 + e 6 )Sg (coordinatewise); 



Lemma 2.6. We are given two vectors V\ < V2 (coordinatewise) . Y\ and Y% are random variables 
following CPD corresponding to V\ and V2, respectively. Then, Yi stochastically dominates Y\. 

Proof. We are not aware of an existing proof of this intuitive fact, so we present one here for 
completeness. The lemma can be proved directly from the definition of CPD, but the proof is 
tedious. Instead, we use Lemma f2.5l to give an easy proof as follows: Consider the sum X of a large 
number iV of nonnegative random variables {Xj}j=i ... jy, each X{ having a very small expectation. 
Suppose ^jSg(Xj) = V\. As N goes to infinity and each E[Xj\ goes to 0, the distribution of X 
approaches to that of Y\ since their total variation distance approaches to 0. We can select a subset 
S of {Xj}j = i v .. j 7v so that Ylies Sg(^i) = V2. So, the sum of the subset, which approaches to Y2 in 
the limit, is clearly stochastically dominated by total sum X. □ 

Lemma 2.7. Let Si, £2 be two sets of light items with E\X(Si)] < 3/e and E[X(S < 2)] < 3/e. // 
Sg(5'i) < (1 + e 6 )Sg(S , 2), then we have that for any /3 > 

Pr[A ? (S 1 ) <0\> Pt[X(S 2 ) < P) - 0(e). 

Proof. Let Z\ and Z2 be the compound Poisson distribution (CPD) corresponding to Sg(S'i) and 
Sg(S , 2), respectively. Denote A = ||Sg(52)||i- Let Y be the CPD defined as Y = J2iLi^i where 
iV ~ Pois((l + e 6 )A) and l^s are i.i.d. random variables with Pr[Yj = s^] = Sg ^ 2 ^ fc for each 
Sfc G S \ {so}. By Lemma [231 Z2 is distributed as Y^iLi^i where N' ~ Pois(A). By the standard 
coupling argument, we can see that 

A{Y,Z 2 ) < A(N,N') = A(Pois((l + e 6 )A), Pois(A)) = 0(e). 
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This is because Pr[N = k] = (1 + e 6 ) k e~ e x Pr[N' = k) for all k e N. Since A = ||Sg(S 2 )||i = 
0(e~ 5 ), (1 + e 6 ) fc e _e A > e _e A = 1 — 0(e). Therefore the total variation distance of iV and N' can 
be bounded by 0(e). 

Since Sg(Si) < (l+e 6 )Sg(52), Z\ (the CPD corresponding to Sg(S\)) is stochastically dominated 
by Y (the CPD corresponding to (1 + e 6 )Sg(S2)) by Lemma [2^61 Therefore, 

Pr[X(5i) <0\> Pr[Zi < /3] - A(X(Si), Zi) 

> Pv[Y < 13}- 0(e) 
>Pr[Z 2 <f3}-A(Y,Z 2 )-0(e) 

> Pt[X(S 2 ) <0\- A(X(S 2 ), Z 2 ) - 0(e) 

> Pr[X(S 2 ) <0\- 0(e) 

We also have a(x(S),X(S)\ < e for S = Si, S 2 . Thus 

Pr[X(S 1 ) <p}> Pv[X(S 2 ) <P\- 0(e). 



This completes the proof of the lemma. 



□ 



Proof of Theorem The proof is similar to the proof of Theorem 11.11 Assume the optimal 
feasible set is S* = H* U L* where items in H* are heavy and items in L* are light. Assume 
our algorithm has guessed H* correctly. Since there is a multidimensional PTAS, we can find a 
set L of light items such that Sg(L) < (1 + e 6 )Sg(L*). By Lemma 12.71 we know that for any 
P > 0, Pi[X(L) < P] > Pi[X(L*) < 13}- 0(e). Therefore, we can get that for any (3 > 0, 
Pi[X(L U H*) < f3}> Pr[X(L* U H*) < j3]- 0(e). Now, we can bound the expected utility loss for 
discretized distributions: 

E[^(X(L* U H*))} - E[ M (X(L U H*))} = £ (p L uh*(x) - P l *vjh*(x))Mx) 



Pluh* (x) - Pl*uh* (x) ) dx 



< a ■ 
= 0(e). 

Finally, we can show the performance guarantee of our algorithm: 

E[n(X(L* U H*))} - E[n(X(L U H*))} < E[^(X(L* U H*))} - K[fi(X(L* U H*))} 

E[fx(X(L* U H*))] - E[/i(X(L U H*)) 
E[n(X(L U H*))} - E\p,(X(L U H*))} 
= 0(e) 

The last inequality follows from Lemma 



+ 
+ 



□ 
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3 Stochastic Bin Packing 



Recall that in the stochastic bin packing (SBP) problem, we are given a set of items B = {bi, 62, ... , b n } 
and an overflow probability < p < 1. The size of each item bi is an independent random variable 
Xi. The goal is to pack all the items in B into bins with capacity C, using as few bins as possible, 
such that the overflow probability for each bin is at most p. The main goal of this section is to 
prove Theorem 11.31 

W.l.o.g., we can assume that p < 1 — e where e is the error parameter. Otherwise, the overflow 
probability is relaxed to p + e > 1, and we can pack all items in a single bin. Let the number of 
bins used in the optimal solution be m. In our algorithms, we relax the bin size to C + O(e), which 
is less than 2C. W.l.o.g., we assume the support oOQ is [0,2C]. Prom now on, assume that our 
algorithm has guessed m correctly. We use B\,Bi,..., B m to denote the bins. 

3.1 Discretization 

We first discretize the size distributions for all items in B, using parameter e, as described in 
Section 12.11 Denote the discretized size of bi by Xi . 

We call item bi a heavy item if E[Xj] > e 15 . Otherwise, bi is light. We need to further discretize 
the size distributions of the heavy items. We round down the probabilities of Xi taking each nonzero 
value to multiples of e 22 . Denote the resulting random size by Xi. More formally, PrpQ = s] = 
[PrpQ = s] • e~ 22 J • e 22 for any s G S \ {0}. Use % to denote the set of all discretized distributions 
for heavy items. We can see that \H\ = (e -22 )' 5 ' = (e~ 22 ) ( e 5 ). Denote them by ni,Il2, . . . 
(in an arbitrary order). 

For a set S of items, we use H(S) to denote the set of heavy items in 5, and use L(S) to 
denote the set of light items in S. We define the arrangement for heavy items in S to be the 
1% (-dimensional vector: 



where £ N is the number of heavy items in S following the discretized size distribution 11/%, 
k = 1, 2, ... , \H\. Suppose we pack all items in S into one bin. By Lemma I2.1l and the assumption 



slightly different from the previous one). The signature of the a set 5 of light items is defined to 
be Sg(5) = X^bes , Sg(6). If S consists of both heavy and light items, we use Sg(5) as a short for 
Sg(L(S)). Moreover, for set S, we define the rounded signature to be 



Ar(S) = (N 1 ,N 2 ,...,N m ) 




Sg(5) : Sg(5) fc = \Sg(S) k ■ e- 6 ] 



for each k = 1, . . . , |«S| — 1. 



Suppose we pack all items in S into one bin. Since E[X(5)] < 3/e, Sg(S)k < 0(l/e 5 ) for any k. 
Therefore, the number of possible rounded signatures is bounded by (3/e 11 )' 5 ' -1 . 

The configuration of a set S of items is defined to be Cf(S') = (Ar(5); Sg(5)). It is straightfor- 
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ward to see the number of all configurations is bounded by 



which is still a constant. 

We also define the s- configuration of a solution SOL = {Si, . . . , S m } (S m is the set of items 
packed in bin Bi) to be 

Cf(SOL) = {Cf(5i),Cf(5a),...,Cf(^ m )}. 

We note that Cf(SOL) is a multi-set (instead of a vector), i.e., the indices of the bins do not matter. 
Hence, the number of all possible s-configurations is bounded by ("^ ) = poly(m). Let C be the 
set of all possible s-configurations. 



3.2 Our Algorithm 

Before describing our algorithm, we need a procedure to solve the following multi-dimensional 
optimization problem: We are given an s-configuration 

Cf = ((Ari, Sg x ), (Ar 2 , Sg 2 ), . . . , (Ar m , Sg m )) , 

Our goal is to find a packing SOL = (Si, . . . , S rn ) such that Ar(S'j) = Ar« and Sg(S'j) < Sgj + e 6 l 
(where 1 = (1, . . . , 1)) for 1 < i < m or to claim that there is no packing SOL = (Si, . . . , S m ) such 
that Ar(S'j) = Ar^ and Sg(Sj) < Sgj for 1 < i < m. If we succeed in finding such a solution, we say 
Cf passes the test of feasibility, otherwise we say Cf fails the test. 

Finding a solution such that Ar (Si) = Cf j for all i is trivial. Now, we concentrate on the set of the 
light items, L(M). In fact, the problem for light items becomes a variant of the multidimensional 
bin packing problem, called the vector scheduling problem, which has been studied in |17| . For 
completeness, we sketch their approach, using our notations. We write a linear integer program, 
solve its LP relaxation and then round the solution to a feasible packing. We use the Boolean 
variables Xij to denote whether the light item bi is packed into bin Bj. We have the integrality 
constraints € {0, 1} and they are relaxed to > in the following LP relaxation: 

1- £7=i^ = i, ieL(M), 

2 - EieL(B) ^( s fc) ' x ij ^ ( S Sj)k, j e {1, • • • , m}, k G {1, . . . , |<S| - 1}, 
3.Xij>0, i E L(M), j 6 {l,...,m}. 

The following proposition states a well-known property for any basic solution of the LP. 

Proposition 3.1. Any basic feasible solution to the LP has at most (\S\ — 1) • m light items that 
are packed fractionally into more than one bins. 

If the above LP has no feasible solutions, we say Cf fails the test. Otherwise Cf passes the test, 
and we find a solution SOL as follows. First, we solve the LP and obtain a basic feasible solution. 
Let F be the set of light items that are fractionally packed into more than bins. By proposition 
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for m <- 1 to n do / / guess the number of bins in OPT 
for each s- configuration Cf = (Cfi, . . . , Cf m ) G C do 

if Pr(Cfj, (1 + 0(e))C) < p + 0(e), Vj and Cf pass the feasibility test then 
Return the solution S obtained from the testing algorithm; 

Algorithm 2: Stochastic Bin Packing 



13.11 \F\ < (\S\ — l)-m. We partition F arbitrarily into m subsets, each containing at most (|<S| — 1) 
items, and then pack the k-th subset into the k-th bin. Since the expected size of a light item is 
less than e 15 , 7Tj(sfc) < e 11 for any i, k. Therefore, Sg(Sj) < Sgj + (|«S| — l)e u = Sgj + e 6 l for any j. 
We need one more notation to describe our algorithm. For a set of items S, let 

Pr(Cf(5),C) = Pr((Ar(5),Si(5)),C) = Pr[X H + Y L >C) 

where Xh = X(H(S)) and Yl is the CPD corresponding to Sg(S) (according to Lemma 12.5ft . By 
definition, if two sets Si and S2 have the same configuration, Pr(Cf (Si), C) = Pr(Cf(S < 2), C). 

Now, everything is ready to state our algorithm. We simply enumerate all s-configurations in 
C. For each s-configuration Cf, we first compute Pr(Cf (S), C). If it is at most p + 0(e), we run the 
feasibility test. If Cf passes the test, the returned solution is our final packing. The pseudocode of 
our algorithm is described in Algorithm [2j 

The algorithm clearly runs in polynomial time since the number of s-configurations is polynomial 
and the feasibility test also runs in polynomial time. 



3.3 Analysis 



The following lemma shows that we can approximate the overflow probability of a bin given its 
configuration. Therefore, it is sufficient to enumerate all possible configurations for each bin to find 
an approximation solution. 

Lemma 3.2. For any set S consisting of at most 3/e 16 heavy items, we have that 



Pr(Cf(5),C) -Pr[X(S) > 



0(e). 



Proof. Let Xh = X(H(S)) and Yl be the CPD corresponding to Sg(5). Since there are at most 
3/e 16 heavy items, by the coupling argument, 



A(X H ,X(H(S))) < 3/e 16 • \S\ ■ e 22 = 0(e). 
Let Yl be the CPD corresponding to Sg(5). By Lemma 12.41 we can see that 

A(Y L ,X(L(S)))=0(e). 
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We can also show A(? L ,Y L ) = 0(e) since || Sg(5) - Sg(5) L = 0(e 6 ). Therefore, 

A(x^ + y L ,!(s)) < a(x h ,x(f(5))) +A(y L ,y L ) + a(yl,x(l(S))) =o(e) 

This finishes the proof of the lemma. □ 

The following lemma shows that we can approximate the overflow probability even if only an 
approximate signature is given. 

Lemma 3.3. For any two sets Si, S 2 such that Ar(Si) = Ar^) and Sg(Si) < Sg(S , 2) + 3e 6 l, we 
have that 

Pr[X(5i) > C] < Pr[X(S 2 ) > C] + 0(e). 

Proof. Let Sg 5 = Sg(S 2 ) + 3e 6 l. Let X H = X(H(Si)) = X(H(S 2 )) (since Ar(Si) = Ar(5 2 )). Let 
Y\,Y 2 be the CPD corresponding to Sg(Si) and Sg(/f>2)> respectively. Let Z,Z\,Z 2 be the CPD 
corresponding to Sg s , Sg(5i) and Sg(S 2 ), respectively. Since || Sg(S^) — Sg(Si) || = 0(e 6 ), we have 
that A(Yi, ZA = O(e) for i = 1, 2 (The proof is almost the same as that of Lemma [2.41 and omitted 
here). By Lemma we have that | Pr(Cf (#),€) - Pr[X(S , i ) > C] | = O(e) for i = 1,2. Since 
|| Sg s — Sg(S < 2) II = 3e 6 , we have that A(Z, Z2) = 0(e). Combining these facts together, we have 
that 

Pr[X(S 2 ) >C]> Pr[X H + Y 2 > C] - 0(e) > Pr[X H + Z 2 > C] - O(e) 
>Pr[X H + Z>C] -0(e). 

On the other hand, Sg(5i) < Sg s . So, by Lemma \2.6\ Z\ is stochastically dominated by Z. 
Therefore, we have that 

Pr[l(Si) > C] < Pr[X H + Y x > C] + 0(e) < Pr[X H + Zi > C] + 0(e) 
<Pr[X H + Z>C] +0(e). 

Combining these two results, we complete the proof. □ 

Now, we are ready to prove the main theorem of this section. 

Proof of Theorem li.ffi Suppose the optimal solution OPT = (Oi,...,O m ) uses m bins. The 
algorithm will enumerate its s-configuration Cf(OPT). Obviously, Cf(OPT) can pass the feasibility 
test. Let SOL = (Si,...,S m ) be the solution obtained by the LP rounding procedure, which 
guarantees that Sg(Sj) < Sg(Oj) + 3e 6 l, for any j. By Lemma l3~3l and Lemma 12.21 for 1 < j < m, 

Pr[X(Sj) > (1 + 0(e))C] < Pr[X(^) > (1 + 0(e))C] + 0(e) < Pr[X(0^) > (1 + 0(e))C] + 0(e) 

< Pv[X(Oj) > C] + 0(e) <p + 0(e). 

The proof of the theorem is completed. □ 
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Run Algorithm [2] to get a packing Si, S2, ■ ■ ■ , S m ; 
for each bin Bi do 

Pack each item in Bi into a new bin Bij; 

while there exist two bins Bij 1 , Bij 2 that P{X(Bij 1 U Bij 2 )) < (1 — e)p do 
(a) |^ Merge Bij 1 and B{j 2 into one bin; 

Algorithm 3: Stochastic Bin Packing without Relaxing the Overflow Probability 



3.4 A 3- Approximation without Relaxing the Overflow Probability 

For SBP, we can find a 3-approximation within polynomial time without relaxing the overflow 
probability p, for any constant < p < 1. First, we note that for any set of items, we can estimate 
the overflow probability by using the technique for counting knapsack solutions [27]. In fact, since 
we assume p is constant, we can simply use the Monte Carlo method to get an estimate with 
additive error e with high probability by randomly taking O(logn) samples. For each item set S 
we use P(S) to denote the estimate probability of PrLY(S') > C + e] Then, we have 

(a) (1 - e) Pt[X(S) > C + e] < P(S) < (1 + e) Pi[X(S) > C + e] 

with high probability (1 — i for some constant c) when Pr[A(S) > C + e] > p. 

We first run Algorithm [2] and obtain a packing B\ , . . . , B m where Si is the set of items in Bi 
and m < OPT. We know that Pi[X(Bi) > C + e] < p + e. Then, for each Bi, we distribute the 
items in Bi to at most 3 new bins such that the overflow probability is not at most p for each 

2 

new bin. Let e be any constant less than fq^- The pseudo-code of our algorithm is described in 
Algorithm [3J 

Proof of Theorem \l-4\ Now we claim that for each Bi, the while loop (a) produces at most 3 new 
bins. The approximation factor of 3 follows immediately since m < OPT. 

Now, we prove the claim. For each bin Bij output by the algorithm, either it packs only one item 
with overflow probability at most p (otherwise, there is no feasible solution), or P(Bij) < (1 — e)p. 
Therefore, the true overflow probability of Bij is at most p. 

We still need to show the while loop terminates with at most 3 bins. Suppose for contradictor 
that it outputs at least 4 bins, and B^i, Bi^, -B«,3, -B«,4 are four of them. Then, P(Bi : \U B^) > (1 — 
e)p. Therefore, Prpf (B^UB^) > C+e] > (l-2e)p. Similarly, Pi[X(B i:3 \JB iA ) > C+e] > (l-2e)p. 
Thus, we have that 

Pr[X(Bi) > C + e] > 1 - (1 - (1 - 2e)p) 2 >p + e 

2 

for any e < f^fr- This contradicts to the fact that Pr[X(Bi) >C + e]<p + e. □ 



4 Stochastic Knapsack 

An instance of the stochastic knapsack problem can be specified by a tuple (7r,C), where it = 
{7Ti,7T2, ...,-K n }. 7Tj is the joint distribution of size and profit for item bi, Vi. C is the capacity of 
the knapsack. W.l.o.g., we can assume \ < C < 2 and the size of each item is distributed between 
and 2C. The relaxed knapsack capacity C + 0(e) should be less than 2C. The distributions for 
different items are mutually independent. We let random variables X\, and P\, denote the size and 
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profit of item b. We use 7r& to denote the probability distribution of X b , i.e., iTb(w) = Pr[.X& = w]. 
W.l.o.g., we can assume that for each item, we obtain a fixed profit for each realized size. For each 
item b, we define the effective profit function H Pb to be 

p b (w) = E[P b \X b = w\- Pr[X 6 = w] for any w € [0, C]. 

We use the shorthand notation pb(I) = J2 we j Pb(w) ^ or an ^ ^ — i^i^l- 

Policies: The process of applying a policy cr on an instance (ir, C) can be represented as a decision 
tree T(o~,n,C). Each node v in T a corresponds to placing an item in the knapsack. Each edge 
e = (v,u) in T a (v is the parent) corresponds to a size realization of v. We use w e , ix e to denote 
the corresponding size and probability of e, respectively. We also use T a to denote T(o~, tt, C) when 
the context is clear. 

We call the path from root to v in T a the realization path of v, and denote it by R(v). For a node 
v, we denote the occupied capacity before inserting v as W{v) = Y^eeRM w e ana - ^ ne probability of 
reaching v as $>(v) = Yl e &R(v) 71 e - Denote by R a the random set of items that a packs. 

We use P(<r, tt, C) to denote the expected profit that the policy a can obtain with the given 
distributions tt and total capacity C. We also use the shorthand notation P(<r) or P(T cr ) if the 
context is clear. Recursively define the expected profit of the subtree T v rooted at v to be 

P(V)= [p v (w e )+TT e -¥( U ) 

e=(v,u)\W(v)+w e <C 

The expected profit P(cr) of policy a is simply P(the root of T a ). We use OPT to denote the 
expected profit of the optimal adaptive policy. We note that in some steps of our algorithm, we 
assume the knowledge of OPT. In fact, any constant approximation of OPT, which for example can 
be obtained using the approximation algorithm in [23] for SK or the one in [35] for SK-CC, would 
suffice for our purpose. 

As we mentioned before, the problem is PSPACE-hard and the optimal policy may be expo- 
nentially large. In order to reduce search space, we need to focus on a very special class of policies, 
in which it is possible to find a nearly optimal policy efficiently and this policy is also close to the 
optimalpolicy for the original problem. We start with some simple properties shown in Bhalgat et 
al. [13] ft W.l .o.g., we also assume that all (optimal or near optimal) policies a considered in this 
paper have the following property: 

PI. For u, v £ T a , if u is an ancestor of v, then P(u) > ¥(v). 

Otherwise, replacing the subtree T u with T v increases the profit of the policy a. This also implies 
that for any v G T a , P(u) < P(<r). 

Lemma 4.1 (part of Lemma 2.4 in [13] ). For any policy a on instance (ir,C), there exists a policy 
a' such that P(a', tt, C) = (1 - 0(e))P(<r, vr, C) and 



3 We find the effective profit function easier to work with than the profit function when the size and the profit are 
correlated. 

4 In fact, they only considered the basic version of stochastic knapsack where the profit is a fixed value for an 
item. However, a scrutiny of their proofs shows that correlated profits do not affect the properties. 
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P2. for any realization path R in T(a',ir,C), *^2 ve R^[X v ] = 0(C/e). 

4.1 Discretization 

In this section, we discuss how to discretize the size and profit distributions for items in B, using 
parameter e. W.l.o.g., we assume that the range of is [0, 2C) for any item b. The discretization 
of the size distributions is the same as the one in Section 12.11 We also need to discretize the profit 
distributions. For each item b and w € [0, 2C], we use Db(w) to denote the discretized size of w 
for item b, i.e., Db(w) is the value of X\, for Xf, = w. The discretized effective profit function p& is 
defined to be 

Pb{s) = ^2 Pb(w), for all s € S. 

w\Di,(w)=s 

This finishes the description of the discretization step 

We need the notion of canonical policies introduced in [13] . A policy a is a canonical policy if 
it makes decisions based on the discretized sizes of items inserted, rather than their actual sizes. A 
canonical policy stops inserting items when the total discretized size of items inserted exceeds the 
knapsack capacity C. Before that, it attempts to insert items even if the total actual size overflows. 
No profit from those items can be collected. In this following lemma, we show it suffices to only 
consider canonical policies. The proof is similar to that of Lemma A. 5 in [13] , which can be found 
in the appendix. Due to the presence of the correlations between profits and sizes, we need to be 
more careful in bounding the profit loss. 

Lemma 4.2. Let it be the joint distribution of size and profit for items in B and n be the discretized 
version of it. Then, the following statements hold: 

1. For any policy a , there exists a canonical policy a such that 

F(a, k, (1 + 4e)C) = (1 - 0(e))F(a, vr, C); 

2. For any canonical policy a, 

P(of, 7T, (1 + 4e)C) = (1 - 0(e))P(or, Sr, C). 

4.2 Block- Adaptive Policies 

To further reduce the search space, Bhalgat et al. [13] discovered a very specific class of canonical 
policies, called block- adaptive policies and showed it is sufficient to restrict the search to this set if 
we are satisfied with a nearly optimal policy. In a block-adaptive policy, instead of inserting one 
item at a time, we insert a set of items together each time. This set of items is called a block. 
A block-adaptive policy a can also be thought as a decision tree where each node in the tree 
corresponds to a block. Each edge incident on a vertex corresponds to a realization of the sum of 
the discretized sizes of all items in the block. 

It has been shown in [13] that for SK, from an optimal (or nearly optimal) adaptive canonical 
policy cr, we can construct a block adaptive policy (with some other nice properties), from which 
we can get almost as much profit as from <r, as in the following Lemma. 
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Lemma 4.3. A canonical policy a with expected profit (1 — e)OPT can be transformed into a 
block- adaptive policy with expected profit (1 — 0(e))OPT when the capacity is further relaxed by eC. 
Moreover, the block-adaptive policy satisfies the following properties: 

Bl. There are 0(e -14 ) blocks on any root-leaf path in the decision tree. 

B2. There are \S\ = 0(C/e 5 ) children for each block. 

B3. Each block M with more than one items satisfies that YlbeM ^ [^b] < £l3 - El 

In Section [5 -H we provide a proof for the generalization of the above lemma to SK-CC. 

4.3 Poisson Approximation 

To search for the (nearly) optimal block-adaptive policy, we want to enumerate all possible struc- 
tures for a block. In |13j . this is done by enumerating all different combinations of the profit 
contributions from O(logra) equivalence classes of items, using the technique developed in |18| . 
Instead, we enumerate all possible signatures of a block, similar to what we have done in Sectional 
Since we consider correlated profits and sizes, a signature needs to reflect the profit distribution as 
well as the size distribution. Formally, for an item b, we define the signature of b to be 

Sg(&) = (Wso),P&(si), • • • ,p 6 (s z _i); vffe(si),7f b (s 2 ), • • • , 7f 6 (s z _i)V 

where p b (s) = [pb(s) ■ e i S Q PT \ ■ and 7f 6 (s) = |_7r fc (s) • J^J • ^ for any s € S. For a block M 

of items, we define the signature of M to be 

Sg(M) = £ 66M Sg(6). 

We denote the "zero size probability" of M to be vf^ = Pi[X(M) = 0] . 

The following lemma shows that it is sufficient to enumerate all signatures for all blocks in a 
block-adaptive policy. 

Lemma 4.4. Consider two decision trees T\,T2 corresponding to block- adaptive policies with the 
same topology (i.e., T\ and T<i are isomorphic). If for each block M\ in T\, the block M2 at the 
corresponding position in T2 satisfies that Sg(Afi) = Sg(Af2), then |P(Ti) — F(T2)| = 0(e)OPT. 

Before proving Lemma |4.4[ we need to prove the following result. 

Lemma 4.5. Suppose the capacity of the knapsack is C < C. Let M\,Mi be two blocks with the 
same signature Sg M . Let a b be a block adaptive policy in which M\ is the root block. Let P(Mi) be 
the expected profit we can get from M\ with a knapsack capacity C Then, replacing M\ with M2 
in T a b incurs a expected profit loss of at most O(e 18 )0PT + 0(e 9 )P(Mi). 

5 In fact, this property was not explicitly mentioned in Bhalget et al. |13| . But it can be concluded from the fact 
that M has total profit at most 2e 14 OPT and each item has a profit density at least eOPT. In our alternative proof 
provided in Section f5. 11 we do not need the notion of profit density. 
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Proof. For ease of notation, we use T\ to denote T a b and T 2 the tree obtained by replacing M\ with 
M 2 . Let P(M 2 ) be the expected profit we can get from M 2 with a knapsack capacity C. First, we 
show the following two useful results. 

(a) |P(Mi) - P(M 2 )| = <3(e 18 ) = O(e 18 )0PT + 0(e 9 )P(Mi). 

(b) a(x(M 1 ),X(M 2 )) = 0(e 18 ). 



It is straightforward to verify the above results for the case where both M\ and M 2 have only one 
item, from the definition of signatures. 

Now, we focus on the case where M\ has more than one items. The case where M 2 has more 
than one items is the same. By Lemma 14.31 B3. E[X(Mi)] = YlbeMi ^l^b] < 2e 13 . Then we have, 

E[X(M 2 )) =E(E ^(s)) - s<^(^ ^(s) + e 23 ) •s<E[X(M 1 )] + 0(e 18 )<3e 13 . 

s>0 feeM 2 s>0 beMx 



> 1 - 2e 9 , and n% h > 1 - 3e 9 . 

'or any item b £ Mi, with probability at least 



By Markov's inequality, ir° Mi = 1 - Pr X(Mi) > e 4 
Suppose we insert the items in Mi one by one. 
TVf^i, the remaining capacity before inserting b is C (all previous items realized to zero size). So 
the expected profit we can get from b is at least 7?^ • Pb([0, C']) . Thus 



P(M0 > ^ £ p 6 ([0,C']) = n° Ml £ £ p 6 (s). 

6eMi 0<s<C 6eA/i 

We also have that 

F(M 1 )< £p 6 ([0,C']) = £ £ft(s). 

beMi 0<s<C bGA/i 

Similarly, we can show that 

*k E E^( s )^ p ( M ^ E E^( s )- 

0<s<C feeA/ 2 0<s<C b£AI 2 

Since Sg(Mi) = Sg(M2) = Sg M , we have that 

I E E^( s )- E E^( s )| = °( el8 ) 0PT - 

0<s<C b€Ah 0<s<C b£M 2 

Linking these inequalities together, we obtain (a). 

On the other hand, since E[X(M X )] < 2e 13 , E[X(M 2 )] < 3e 13 and vf A /i(s) = 7f M2 (s) for any s, 
we can show (b) holds also by following the same proof as that of Lemma 12.41 which we do not 
repeat here. 

Let v s be the child of Mi corresponding to size realization s, and T s be the subtree rooted at 
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v s . Given (a) and (b), we have that 



P(T 2 ) = P(M 2 ) + ^Pr[x(M 2 ) = s]p(T s ) 

s 

> P(Mi) - 0(e 18 )OPT - 0(e 9 )P(Afi) 

+ J2 Fl [^( M = s ] P ( T s) - A (x(Mi),X(M 2 )) • maxP(T s ) 

s 

= P(Ti) -O(e 18 )0PT-O(e 9 )P(Mi). 
The last inequality holds since PI: max s P(T s ) < OPT. □ 

Proof of Lemma \4-4[ We replace all the blocks in T a b by the corresponding ones in T a b . By 
Lemma 14,5} the total profit loss is at most 



HM) [O(e 18 )0PT + 0(e 9 )¥ M ] 

M 

= O(e 18 )^$(M)0PT + O(e 9 )^$(M)P A f 

M M 

= O(e 18 )^$(M)0PT + O(e 9 )P(^) 

M 

< O(e)0PT 

The last inequality holds because P(cjJ) < OPT and the depth of T a t is 0(e~ 14 ), thus YjM ®( M ) = 
0(e- 14 ). ' □ 

The number of possible signatures for a block is (0(n/e 23 )^°^ e ^ = n vo[y ^ 1 /' L \ which is a 

polynomial of n. For any block adaptive policy a b , there are at most /(e) = (0(e~ 5 )) ^ e ^ = 
2P ol y( 1 / <E ) blocks in its decision tree, since the height of the tree is 0(e -14 ) and the branching factor 
is at most |5| = 0(e -5 ). Therefore, the number of all topologies of the decision tree is a constant. 



4.4 Finding a Nearly Optimal Block- Adaptive Policy 

We have shown it suffices to enumerate over all topologies of the decision trees along with all 
possible signatures for each block (the number of all possibilities 

i s n O(/(e)) _ re 2P°Mi 

) in order 

to find a nearly optimal block-adaptive policy. Now, we show how to find a nearly optimal block- 
adaptive policy with a given tree topology along with the signatures for all blocks, using dynamic 
programming. 

The dynamic program is fairly standard and we present a sketch here. Assume the tree topology 
has been fixed. A configuration C in the dynamic program is a set of signatures, each corresponding 
to a block in the tree. As we have shown, the number of configurations is poly(n) . We use DP(i, C) = 
1 to denote the fact that we can reach configuration C using a subset of {b±, . . . Otherwise, 
DP(i,C) = 0. Initially, DP(0,0) = 1. We compute all DP(i,C) values in an lexicographically 
increasing order of (i, C). The value of DP(z, C) can be computed from the values of DP(i — 1, C) 
for all C < C (coordinatewise). In fact, this step can be done as follows. Suppose we want 
to compute DP(i,C). We can decide to place item b{ in a few blocks in the decision tree. The 
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constraint here is no two blocks where we place bi have an ancestor-descendant relationship. Since 
the size of tree is /(e) = (l/e 5 )°( 1//<E so the number of possible ways of adding item bi is 
which is (still) a constant. For a particular placement of bi, we subtract the contribution of bi from 
configuration C (i.e., subtract Sg(6j) from the vectors in C corresponding to the blocks where we 
place bi), resulting another configuration C. We let DP(z, C) ^— max(DP(i, C), DP(i — 1, C)). Since 
the number of tree topologies is a constant, the number of configurations is and computing 
each DP(i, C) values takes a constant time, the overall running of our algorithm is 0{n^ e '), which 
improves upon the n°(f( € ))° running time in |13j . 

Now we have all necessary components to show the main theorem of this section. 

Proof of Theorem 11.51 ' Suppose a* is the optimal policy. The optimal expected profit is denoted 
as P(<7*,7r,C) = OPT. Given an instance (vr,C), the first step is to compute the discretized 
distribution tt. Then we use the dynamic program to find a nearly optimal block adaptive policy 
a for (tt, (1 + 5e)C). By result 1 of Lemma 14.21 there exists a canonical policy a such that 

P(a, tt, (1 + 4e)C) > (1 - 0(e))P(<r*, tt, C) = (1 - O(e))0PT. 

By Lemma l43| there exists a block adaptive policy a b such that F(a b , tt, (l+5e)C) > (1 — O(e))0PT. 
Since the configuration of a b is enumerated at some step of the algorithm, our dynamic program 
is able to find a block adaptive policy a with the same configuration (the same tree topology and 
the same signatures for corresponding nodes). By Lemma 14.41 we can see that 

P(<r,7r, (1 + 5e)C) > (1 - 0(e))F(a b ,n, (1 + 5e)C) > (1 - O(e))0PT. 

By result 2 of Lemma 14.21 

P(<r, tt, (1 + 4e)(l + 5e)C) > (1 - 0(e))P(<r, Sr, (1 + 5e)C) > (1 - O(e))0PT. 

Hence, the proof of the theorem is completed. □ 

5 Stochastic Knapsack with Correlations and Cancelations 

Recall in the stochastic knapsack problem with correlations and cancelations (SK-CC), we can 
cancel a job in the middle and we gain zero profit from a canceled job. If we decide to cancel job b 
after running for t time units, job b can be thought as a job b l with running time Xj,t = min{Xb, t}, 
where Xj, is the processing time of b. The effective profit of the new job p&(x) equals Pb(x) if x < t 
and if x > t. Since we consider discrete time distributions, it only makes sense to cancel a job 
after a discrete point with nonzero probability mass. Suppose the size of the support of each time 
distribution is bounded by m. Therefore, for each job b, we can use a set of m jobs to represent 
all possible cancelations of b, and in each realization path, we are allowed to choose at most one 
job from the set. In fact, we solve the following more general problem. We have n sets of items, 
Mi,i = 1, 2, n. Each set Bj consists of several items bn, bi%,.... Our goal is to find a policy 
that packs at most one item from each item set Bj to the knapsack with capacity C such that the 
expected profit is maximized. We call this problem the generalized stochastic knapsack problem, 
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denoted as GenSK. 

It is not clear how to use the technique in Bhalgat et al. [13] to handle the above problem. 
In fact, they discretize all (size) probability distributions into q = O(logn) equivalent classes and 
enumerate all combinations of the profit contributions from different classes in each block. For this 
purpose, they adopt the technique developed in [18] to reduce the number of combinations to a 
polynomial. Once the profit contribution from a class to a block is fixed, the actual set of items 
assigned into that block can be easily determined in a greedy manner (items with larger profit 
should be packed into the blocks that are closer to the root). However, in SK-CC, Bj may contain 
items from several classes. Assigning an item in B, to a block would prevent us from assigning 
any other items in Bj to the same block and its descendants. Such dependency makes the item 
assignment very complicated (for a fixed combination) and it is not clear to us how to do this in 
polynomial time. However, our technique can be easily extended to SK-CC. 

5.1 Block- Adaptive Policies 

In this section, we show that Lemma 14.31 also holds for GenSK (thus also for SK-CC). We note 
that the proof in [13] does not generalize to GenSK and we need to modify the proof in some 
essential way. We note that our proof works even when there are arbitrary precedence or cardinality 
constraints imposed on the items. In fact, any realization path of the constructed block-adaptive 
policy corresponds to some realization path of the original policy a. The idea of the proof may be 
useful in showing the existence of nearly optimal block- adaptive policies for other problems, thus 
may be of independent interest. 

Proof. For any node v in the decision tree T$, we define the leftmost path of v to be the realization 
path which starts at v, ends at a leaf, and consists of only edges corresponding to size zero. For 
any node v and size s G 5, we use v s to denote the s-child of v, that is the child of v corresponding 
to the size realization s. We define the segment starting with node v (denoted as seg(u)) in as 
the maximal prefix of the leftmost path of v such that: 

1. If E[X(v)} > e 13 , seg(u) is the singleton node {v}. Otherwise, E[X(seg(v))} < e 13 ; 

2. For any two nodes u,w € seg(v), and any size s, |P(T„J — P(T Wg )| < e 5 OPT. 

We partition into segments as follows. We say a node v is a starting node if v is a root or 
v corresponds to a non-zero size realization of its parent. For each starting node v, we greedily 
partition the leftmost path of v into segments, i.e., delete seg(t> ) and recurse on the remaining part. 
Fix a particular root-to-leaf path R. Let us bound the number of segments on R. Suppose we are 
at node u, which is a node in seg(v), and the next node we are about to visit is w which is not in 
seg(u). We know that one of the following events must happen: 

1. w corresponds to a non-zero size realization of u; 

3. For some size s, |P(T„ S ) -F(T Wb )\ > e 5 OPT. 
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The first event happens for at most 0(C/e 4 ) times. Since ^2 veR K[X(v)] = 0(C/e) by Lemma \$~T\ 
the second event happens at most 0(e -14 ) times. Suppose R = {v l ,v 2 , . . . ,v k }. W.l.o.g., we can 
assume that for each size s, OPT > F(T v i) > P(T V 2) > . . . > P(T v k) > by a simple substitution 
argument similar to PI. For each particular size s, the third event occurs for 0(e -5 ) times. Since 
there are at most \S\ = 0(e -5 ) different sizes, we need at most |<S| -e~ 5 = 0(e _1 °) parts. This gives 
a bound 0(e -14 + e~ 10 + e~ 4 ) = 0(e~ 14 ) on the number of segments on each root-leaf path. 

Now, we are ready to describe the algorithm, which takes a canonical policy a as input, and 
packs items into the knapsack in a block-adaptive way. For any segment seg(f), we use l(v) to 
denote the last node in seg(f). Similar to the argument in |13| . we use two knapsacks, the main 
knapsack with capacity C and the auxiliary knapsack with capacity eC. 

1. Initially, S = 0. S represents the auxiliary knapsack. 

2. We start at the root of 
repeat 

3.1 Suppose we are at node v in Tj. Add the items in seg(v) to the main knapsack one 
by one until some node u realizes to a nonzero size, say s. 

3.2 Add all remaining items in seg(f ) to the auxiliary knapsack S. 

3.3 Visit node l(v) s , the s-child of the last node of seg(v). 

3.4 If all nodes in seg(v) realize to size 0, visit l(v)o. 
until A leaf in T~ is reached. 

4. If the auxiliary knapsack overflows, discard the entire profit. 

We can see that the set of items the algorithm attempts to insert always corresponds to some 
realization path in the original policy a. Hence, the algorithm packs at most one item from each 
item set B^,. We have shown that Bl holds. Properties B2 and B3 are straightforward from the 
definition of segments and the algorithm. 

Now we show that the expected profit that new policy can obtain is at least (1 — O(e))0PT. 
Let us focus on the profit we collect from the main knapsack and ignore those from the auxiliary 
knapsack. We note that the main knapsack never overflows. Our algorithm deviates the policy a 
whenever some node u in the middle of some segment seg(u) realizes to a nonzero size, say s. In 
this case, a would visit u s , the s-child of u and follows T Us from then on, but our algorithm visits 
l(v) s , the s-child of the last node of that segment, and follows Th v y. The expected profit loss in 
each such event can be bounded by |P(T Us ) — P(Tu v \ s )\ < e 5 OPT. Suppose a pays such a profit loss, 
and switches to visit l(v) s . Hence, a and our algorithm always stay at the same node. Note that 
the number of edges corresponding to nonzero size realizations is at most C/e in any root-to- leaf 
path. So <7 pays at most 0(e -4 ) times in any realization. Therefore, the total profit loss is at most 
O(eOPT). 

Finally, we note that we may not be able to collect the profit from the main knapsack in every 
realization since the auxiliary knapsack may overflow, in which case we lose all the profit. An easy 
(but important) observation is that the decision of the policy is independent of the size realizations 
of the items in the auxiliary knapsack S. Therefore, we can think that the sizes of the items in S are 
realized after the execution of the algorithm. If S overflows, we lose the entire profit. Since there are 
at most C/e 4 items realizes to a nonzero size, S is packed with items from at most C/e 4 segments. 
By the first property of a seg ment, E[X(S)] < e 9 C. By Markov's inequality, Pr[X(5) > eC] < e 8 . 
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Hence, the expected profit we gain is at least (1 — e 8 )(l — O(e))0PT. 



□ 



5.2 Finding a Nearly Optimal Block- Adaptive Policy 

We need to modify the dynamic program to incorporate the constraint that at most one item 
from each Bj can be packed. Now, we use DP(i,C) = 1 to denote the fact that we can reach 
configuration C using items from {Bj., . . . , Bj}, such that on each realization path, at most one item 
in Bj can appear at most once. Suppose we want to compute the value of DP(i, C) for some i and 
configuration C. Suppose we have computed the values of DP(i — 1, C) for all C < C. We can 
decide to place items from Bj in a few blocks in the decision tree. The only constraint here is that 
we can place at most one item from Bj in each realization path. So the number of ways to do so is 
bounded by m/( e ) for /(e) = 2( 1 / eJ )° <1/E , which is a polynomial of the input size. So the overall 
running time of the dynamic program is still a polynomial. The rest of the analysis is the same as 
before and we do not repeat it here. 

In summary, we have the following theorem, from which Theorem 1 1 . 61 follows as a direct corollary. 

Theorem 5.1. For any e > 0, there is a polynomial time algorithm that finds a (1 + e)- approximate 
adaptive policy for GenSK when the capacity is relaxed to 1 + e. 

SK-CC with Exponential Number of Realizations We have assumed that the size distribution of 
each item has a polynomial size support. Now, we consider the case where the supports may be 
of exponential sizes (must be represented implicitly). The only assumption we make here is that 
for any item b% and time t, we can compute the signature of bj in polynomial time. The catch is 
that even though there are exponential even infinite realizations, the number of possible signatures 
is bounded by a polynomial. Moreover, as we increase t, each coordinate of the signature of b\ 
changes monotonically, thus the same signature does not appear again. Hence, starting from t = 0, 
we can use binary search to identify the first point where the signature changes. Repeating this, 
we can find all different signatures for b, each corresponding to b l for some t, in polynomial time. 
In the dynamic program, we only care the signature of an item, instead of an item per se. So we 
can let Bj contain only those b\s that correspond to distinct signatures. The size of Bj is therefore 
bounded by a polynomial. 

SK-CC without Relaxing the Capacity Combining this result and the algorithm developed in [12] 
we can give a (2 + e)-approximation for SK-Can (Theorem 1 1 . 7|) . This improves the factor 8 approx- 
imation algorithm developed in [35]. We note the algorithm in [12] does not work for correlated 
sizes and profits. So whether there is (2 + e)-approximation for SK-CC is still open. However, with 
mild assumptions on the size distributions, we can achieve an approximation factor of 2 for SK-CC 
(Theorem IC.ip . The details can be found in Appendix [Cl 

6 Bayesian Online Selection 

In this section, we consider the Bayesian online selection problem subject to a knapsack constraint 
(denoted as BOSP-KC). Our problem falls into the framework formulated in [32]. In BOSP-KC, we 
are given a set of items B = {b\, 62, ... , b n } and a knapsack capacity C. Each item 6j has a random 
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size Xi and a random profit Pj. Xi and Pj can be correlated but different items are independent of 
each other. The (discrete) joint distribution ir of Xi and Pj is the input to the problem, in the form 
of 7Ti(x,p) = Pr(Xi = x,Pi = p). Let T>{ be the support of 7Tj, i.e., T>% = {(x,p) \ TTi(x,p) ^ 0}. An 
adaptive policy a can choose an item 6, £ IB each time, view the size realization of Xi, and then make 
a irrevocable decision whether to pack bi or not. If a decides to pack bi into the knapsack (given 
the remaining capacity is sufficient), the profit Pj is collected. Otherwise, no profit is collected, 
the remaining capacity does not change and we can not recall bi later. If the items arrive in a 
predetermined order, we call the problem the fixed order BOSP-KC problem. 

Our problem is closely connected to the knapsack secretary problem [6] in the following sense. 
In the knapsack secretary problem, the size and the profit of each item are unknown in advance 
and the items arrive in a random order. When an item arrives, its size and profit become known 
to the decision maker and an irrevocable decision has to be made. Therefore, if we intentionally 
forget about the stochastic information and process the items in a random order, BOSP-KC be- 
comes exactly the knapsack secretary problem, for which a constant factor competitive algorithm 
is known [6] q The ability that we can adaptively choose the order of the items in BOSP-KC is 
very similar to the free order models of matroid prophet inequality |16j and matroid secretary |39j . 

In the remainder of this section, we focus on proving Theorem 11.81 First, we consider as a 
warmup an interesting special case where the profit of each item is a fixed value. In this case, we 
can show the following intuitive lemma. The proof can be found in the appendix. 

Lemma 6.1. Let a be the optimal policy. Suppose a chooses b as the next item to consider, and 
decides to discard b if Xb realizes to t. Then a should discard b if Xb realizes to a larger size s >t. 

The above lemma suggests that at a particular stage, for an item bi, there is a cutoff point t 
such that we accept b if Xi < t, and reject bi otherwise. In this case, item bi is equivalent to an 
item with the size X\ and profit P\, jointly distributed as follows: 



In fact, this viewpoint allows us to reduce BOSP-KC to GenSK. For each item bi, we create a set of 
items Mi = {bj}t where b\ represents the item bi with cutoff point t. The only requirement is that 
at most one item from Bj can be packed in the knapsack. Since we assume discrete distributions, 
there are at most a polynomial number cutoff points. Hence, the size of GenSK instance we create 
is also bounded by a polynomial. Theorem 11.81 directly follows from Theorem 15.11 

Now, we consider the general case where Xi and Pj are correlated. In this case, there is no single 
cutoff point as before. However, we can still reduce the problem to GenSK. Suppose we decide to 
consider bi at a particular stage, and decide to accept bi if and only if the realization (Xi,Pi) € D 
for some DCPj. We call D the acceptance set. Then, bi is equivalent to an item with the size X? 
and profit P t D , jointly distributed as follows: 



We note that most work on secretary problems and prophet inequalities measures the performance of the algo- 
rithm by comparing the solution found by the online algorithm against the offline optimum, while the approximation 
ratios in this paper are computed by comparing against the best adaptive policy. 




(Xi, Pi), Xi > t; 

(0,0), x,>t. 



{X?,P?) = [ 



{Xi,Pi), (Xi,Pi)ED; 
(0,0), Otherwise. 
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Figure 1: The staircase structure of an acceptance set D in an optimal policy. The solid points are 
in D and the crossed points are not. 

However, the GenSK instance created can be exponential in size since each subset D C T>i corre- 
sponds to a distinct item in Bj. We may reduce the size of B, by exploiting the simple observation 
that in any optimal policy, if (x,p) £ D, any (x',p') with x' < x and p' > p (a realization with a 
smaller size and a larger profit) must be in D (we can use the same proof as Lemma 16.11 to show 
this). But the resulting size is still exponential. 

Now, we sketch an algorithm that reduces the size of B, to a polynomial and incurs a profit loss 
of at most O(eOPT). We modify the distribution 7Tj as follows. 

1. For any (x,p) £ T>i such that x < — , we move the probability mass at point (x,p) to point 
(0,p). This step affects the total size of all packed items by at most eC since there are at 
most n items. 

2. For any (x,p) G T>i such that p < e0 J >T , we move the probability mass at point (x,p) to point 
(x,0). This step affects the total profit by at most eOPT. 

3. For other (x,p) € Vi, we move the probability mass at point (x,p) to point (x',p') where x' 
is the largest value of the form + e) k (for some k € Z + ) that is at most x, and p' is the 
largest value of the form £ °^ T (1 + e)' (for some / € Z + ) that is at most p. Obviously, this 
step affects the total size and total profit by at most a multiplicative factor 1 + e. 

Now, the number of different sizes is at most log 1+e n and the number of different profits is at 
most log 1+e "qpj x where p m ax is the maximum possible profit of any realization. We first assume 
Pmax is at most OPT. In this case, the size of T>i is at most logf , e n = 0(log 2 n) and the number 
of all subsets is 2°( log n ) = n°'' og "', which is slightly larger than a polynomial. To reduce this 
number to a polynomial, the simple observation we made before comes in handy. That is if an 
optimal policy accept a realization, it must accept any realization with a smaller size and a larger 
profit. Think all points in Bj are arranged as log 1+e n x log 1+e n grid points, in the 2d plane. If we 
include a point (x,p) in D (i.e., we accept the point), we need to include all points to the upper 
left of (x,p). Hence, the points in D form a staircase structure. The number of such structures are 
bounded by the number of monotone paths from the origin to the upper right corner of the grids, 
which is 



log 1+e n + log!. 



n 



log 



l+e 



J? 



< 



(21og 1+£ n) lo g 
(log 1+e n/e) lo e 



1 + 6 ' 



<(2e) 



poly(n). 
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We have reduced the size of Bj, thus the size of the GenSK instance, to a polynomial. 

Now, we consider the case where p max > OPT, Consider a particular item 6j. Suppose for some 
realization (x,p) S p > OPT. We call (x,p) a huge profit realization. A simple observation 
is that if a huge profit realization (x,p) is realized and x is no more than the remaining capacity 
C, we should accept (x,p). Moreover, we can not accept any (x,p) such that x > C. Hence, 
our search range is restricted in [0, C] x [0,OPT]. See Figure [TJ For each C', this reduces to the 
previous case. Since we only need to consider log 1+e n different C values, the number of different 
acceptance sets is clearly bounded by a polynomial. Therefore, by Theorem 15. 1\ we complete the 
proof of Theorem 11.81 

Fixed order BOSP-KC: For the fixed order model, the above reduction also works. The only change 
is that in the GenSK instance we reduce to, we are required to examine the items in a fixed order. 
Lemma 14.21 and Lemma 14.31 also hold in this case. So, it suffices to find a nearly optimal block- 
adaptive policy. We can also modify the dynamic program in Section 15.21 to find a block-adaptive 
policy subject to a particular order, as follows. Let DP(i, C) = 1 to denote whether we can reach 
configuration C using items from {Bi, . . . ,Bj}, exact one item from each Bj, such that the block 
where the item from Mj is placed should be no lower (w.r.t. the decision tree) than the block 
where the item from B& is place for any j < k. When we want to compute DP(i, C), we can only 
place items from Bj in the lowest blocks (w.r.t. the decision tree) with non-zero signatures (these 
blocks can be directly determined from C). The number of such placements is clearly bounded by 
a polynomial since the number of blocks is a constant. The rest is the same as in Section [5.21 

Finally, we note that we can also get a constant competitive algorithm when compared with 
the offline optimum, using simple LP techniques [16] 0. The algorithm can provide the information 
of OPT, up to a constant factor, that is needed in the discretization. 

7 Concluding Remarks 

We develop the Poisson approximation technique and successfully apply it in finding approximate 
solutions for a variety of stochastic combinatorial optimization problems. These problems range 
from fixed set optimization problems to adaptive online optimization problems, from problems 
with a single probabilistic objective function, to problems with many probabilistic constraints. 
Our technique is conceptually simple, easy to apply, and has led to simplifications, generalizations 
and/or improvements of several previous results. 

Our technique also seems quite flexible and could be potentially combined with other techniques 
to yield new results for other stochastic optimization problems. For example, we could first apply 
the technique in [LjJ to discretize the distributions into O(logn) equivalent classes. This could 
further reduce the number of possible signatures, which might be essential for problems exhibiting 
more complex combinatorial structures. 

In the realm of approximating the distributions of the sums of n random variables, the Poisson 
approximation theorem and its relatives [10] work most effectively in the regime where, roughly 
speaking, the sum of the expected values of those random variables stays constant as n increases. 
This is quite different from the Gaussian approximations (e.g., CLT, Chernoff bounds, Berry-Esseen 

7 We can call such result a knapsack prophet inequality. 
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type inequalities) which typically require the sum of the expected values increases as n. In fact, 
the theory of Poisson approximation is an important area in probability theory [10] , but it has not 
been explored and utilized in algorithmic applications as extensively. We believe our technique, 
and more generally the theory of Poisson approximation, can find wider applications in stochastic 
combinatorial optimization and other domains. 
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A Discretizing small size region 

More formally, let < d < e 4 be the value such that Pr[X 6 < d \ X b < e 4 ] • e 4 > E[X b \ X b < e 4 ] and 
Pr[Ab > d | X b < e 4 ] • e 4 > E[X b \ X b < e 4 ] (note that such d must exist). For ease of presentation, 
we can imagine the size value d being two distinguishable values, d\ = d — ^ and c?2 = d + ^, 
and we let d± < d < cfo. The probability mass Pr[Xfe = d] also consists of two parts PrLYfe = d\] 
and Pr[X b = cfo] and the value are so set that PrLYb < d\ \ X b < e 4 ] • e 4 = ~E[X b \ X b < e 4 ] and 
Pr^Q, > c?2 | X b < e 4 ] • e 4 = E[X b \ X b < e 4 ]. Therefore, in this new item, we can use value 
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d as the threshold value. Note that the effective profit pb(d) should also be divided accordingly: 
p 6 (di) = p b (d) Pr[X b = di]/ Pr[X b = d] and Pb (d 2 ) = p b (d) Pi[X b = d 2 ]/Pr[X b = d]. 



B Missing Proofs 

Lemma 12.11 Suppose each item b € B has a non-negative random weight X b taking values from 
[0,aC] for some constant a > 1. Then, for any 5CB and any ^ > e > 0, if E[[i(X(S))] > e, then 
E [X(5)] < 3a/e. 

Proof. Since X^s are independent, Var[X(S')] = ^2 b€S Var[Xj. As € [0, aC], we have 

Y&i[X b ] < E[X 5 2 ] < aC ■ E[X b ]. 

So, Var[X 5 ] < T, be s aC ' E M = « C ' E [ x ( 5 ')]- Suppose for contradiction that E\jj,(X(S))] > e 
and E[A A (S')] > 3a/e. Then, we can see that 

Vex[X(S)] > Pr[X(S) < C] • (E[X(S)} - C) 2 

> E[^X(S))} ■ (E[X(S)} - C) 2 

> e • (E[X(S)] - C) 2 

= e (E[X(5)] - 2C + C 2 /ELY(,S)]) E[X(S)\ 

> e(3a/e-2C + eC 2 /3)ELY(S)] 

> (3a - 2eC)E[X(S)] > aC • E[X(S)}, 

which contradicts the fact that Var[X(S)] < aCE[X(S)}. □ 
Lemma 12.21 Let S be a set of items such that E [X(S')] < 3/e. For any < j3 < C, we have that 

1. Pr[X(S) <f3}< Pr[X(S) < /3 + e] + O(e); 

2. Pr[X(S) < /3] < Pr[X(5) < /3 + e] + O(e). 

Proof. We prove the lemma for each step of discretization. To avoid using a lot of notations, when 
the context is clear, we always use X b to denote the size of b before a particular discretization step 
and X b to denote the size after that step. 

Step 1: Let 5 b = X b — X b . By our discretization, E[5 b ] = E[X b ] — E[X b ] = 0. Moreover, 

Var[4] = E[5 2 b ]-E 2 [8 b ] =E[5 2 b ] 

= Pi[X b < e 4 ] • E[(X b - X b ) 2 | X b < e 4 ] 

< E[(X b ) 2 | X b < e 4 ] + E[(X b ) 2 | X b < e 4 ] 

< e 4 (E[X b ]+E[X b }) <2e 4 E[X 5 ] 

Let 5(S) = ^2 b€ s$b- By linearity of expectation, E[5(S')] = 0. As A^s are independent, 
Var[(5(S')] = YlbeS^ aT i^] — 2e 4 E[X(S')] < 6e 3 . Therefore, the first inequality can be seen as 
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follows: 



Pr[X(S) < 0] 



Pr[X(S) < p A 5{S) < e] + Pr[X(5) < /3 A <f(S) > e] 
Pr[X(S) < /3 + e] + Pr[<J(5) > e] 
Pr[X(S) <P + e] + Yw[5{S)]/e 2 
Pr[X(S) < + e] + 6e 



< 



< 



< 



The proof for the second inequality is essentially the same and omitted here. 

Step 2: Noting that for step 2 we have X& < Xb < (1 + e)X&, the lemma is obviously true. 



Lemma 14.21 Let ir be the joint distribution of size and profit for items in B and ir be the 
discretized version of tt. Then, the following statements hold: 

1. For any policy a, there exists a canonical policy a such that 



Proof. For the first result, we first prove that there is a randomized canonical policy a r such that 
P(oy,7r, (1 + 4e)C) = (1 — 0(e))P(er,7r, C). Thus such a deterministic policy a exists. 

In the decision tree T{a, it, C), each edge e = (v,u) corresponds to an actual size realization of 
item v. We use w e to denote the discretized size of w e , i.e., w e = D v (w e ). 

The randomized policy a r is derived from a as follows. T (7r has the same tree structure as T a . 
If a r inserts an item b and observes a discretized size s € <S, it chooses a random branch in a among 
those sizes that are mapped to s, i.e., {w \ D^w) = s} according to the probability distribution 
Prfbranch w is chosen] = 7T& (it;) /7r&(s), where 7?6 is the discretized version of tti,. We can see that 
the probability of an edge in T ar is the same as that of the corresponding edge in T a . The only 
difference is two edges are labeled with different lengths (w e in T a and w e in T ar ). 

Now, we bound the profit we can collect from T ar with a knapsack capacity of (1 + 4e)C. Recall 
R(v) is the realization path from the root to v in T a . Let W(v) = X]eGfi(D) W.l.o.g., we assume 
that Vt> E T(a,TT, C), W(v) < C. By definition, we have that p v (s) = Yl e =(v u)\w e =sPv( w e)- By our 



This completes the proof of the lemma. 



□ 



P(cf, tt, (1 + 4e)C) = (1 - 0(e))P(<7, tt, C); 



2. For any canonical policy a 



P(a, tt, (1 + 4e)C) = (1 - 0(e))P(oF, tt, C). 
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construction, we have that 



P(<x r ,5r,(l + 4e)C) = ^ 

u,s|W(u)+s<(l+4e)C 
e={v,u)\W(v)+w e <(l+4e)C 

Hv)p v (w e ) 

e=(v,u)\W (u)<(l+4e)C 

= P(<7,7T,C)- Yl Hv)Pv(We) 

e=(v,u)\W(u)>(l+4e)C 



> P(ct,tt,C) 



i - £ 



where 5i = {v e r(a,7r,C) | W(u) < (1 + 4e)C and 3e = (v,u),W(u) > (1 + 4e)C}. The last 
inequality holds due to Property PI. We upper bound YlveSi ^( v ) m the following lemma. 

Lemma Let a be a policy and let (ir, C) be an instance of the stochastic knapsack problem. Let S 
be a set of nodes in T(a, ir, C) that contains at most one node from each root-leaf path. Then we 
have that 

E *(«) = o( e ). 

ueS||W(«)-W(«)|>e(C+l) 

Proof. We can assume w.l.o.g. that S 1 is the set of leaves in T a . We prove this lemma for both 
steps of the discretization in Section [2.11 First, we show after Step 1: it holds that 

(a) E veS \\W(v)-W(v)\>e®( V )=°( e )- 

For u (£ T a , let b u be the item corresponding to u and define 5 U = X[, u — X bu . As we have shown 
in Lemma E[«5 U ] = and Var[<f u ] < 2e 4 ELY fc J. 

Let R a be the random path (consisting of both nodes and edges) a would choose. For u € T a , 
let R u + be the random path a would choose after reaching u (including u and edges incident on 
u). Define 8(R U+ ) = ~Yl< e( zR u+ {w e — w e ). Since a packs an item b before it realizes to a particular 
size, we have that: E[5(i? u+ )] = J2 U '£T U ^M " chooses u' \ a choosesii]E[£ u '] = 0. Moreover, 

Var [6(R U+ )} = E [(5(R U+ )) 2 } - (E[5(R U+ )}) 2 = E [(5(R U+ )) 2 ] 

= Y ■ E [K - W e + 5{R u > + )f] 

e=(u,u') 

= J2 ^e{(w e -We) 2 +0 + E[(5(R u , + )) 2 }) 
e=(u,u') 

< Var[5 M ] + max Vax[8(R u >+)] 

e=(u,u') 

< 2e 4 E[X 6 J+ max Vax[8(R uf+ )\ 

e=(u,u') 
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By Property P2, ^ u g_R(t>) ^[-^"f>J = 0(€-/e) for any root-leaf path R(y). Then we have Va,v[5(R a )] = 
0(e 3 ) by induction. By Chebychev's inequality, we get that 

veS\\W(v)-W(v)\>e 



Step 2: For step 2, \w e — w e \ < ew e and W(v) < C for any e,v. Thus we have that: 



(b) Vv E S, W(v) - W(v] 



< eC. 



From (a) and (b), we can conclude the lemma. □ 

For any edge e = (v, u) € T(a, it, C), we have W(u) < C and w e — w e < e 4 . Thus for any v G Si, 
we have 

W(v) - W{v) > max { W(u) - W{u) - e 4 } > (1 + 4e)C - C - e 4 > e(C + 1). 

Therefore, Ei,eSi = ^eSi|W(i;)-VK(«)>e(C+l) Note that ^ contains at most one node 

from each root-leaf path. Applying the above lemma again, we have that X^eSi ^( v ) = 0{e). This 
completes the proof of the first part. 

Now, we prove the second part. $(u)s are defined with respect to T(<7,7r,C). Since a canonical 
policy makes decisions based on the discretized sizes, T(a,Tr, C) has the same tree structure as 
T((j, 7r, (1 + 4e)C), except that we can not collect the profit from the later if the knapsack overflows 
at the end of the policy. More precisely, we have that 

P(5,vr,(l + 4e)C) = ^ <S>(v) Pv (w e ). 

e=(v,u)\W(u)<(l+4e)C 

where e € T(cr,7r,C). W.l.o.g., we assume that W(v) < C holds for all v 6 T(<7,7r,C). Thus 

P(cr, 7T, (1 + 4e)C) > P(3F, Sr, C) 1 - ^ $(t 

where 5 2 = {v G T(5,7r,C) | W(t;) < (l + 4e)C and 3e = (v,u),W(u) > (l + 4e)C}. By Lemma ??, 
we have X^es 2 ^( v ) = 0(e). This completes the proof of the lemma. □ 

Lemma 16.11 Let a be the optimal policy. Suppose a chooses b as the next item to consider, and 
decides to discard b if Xb realizes to t. Then a should discard b if X^ realizes to a larger size s > t. 

Proof. We use P(<r, B,C) to denote the expected profit the policy a can achieve with item set B 
and remaining capacity C. Since a discards b when Xb realizes to t, we have that P(<r, B\{6}, C) > 
P b + P(<r, B\{6}, C - 1). We also have P(<r, B\{6}, C'-t)> F(a, M\{b},C ~ s) since C'-t>C-s. 
Thus, P((r,B\{6},C) > P b + P(cr,B\{6},C - s). Therefore, a should discard b when X b = s > t, 
in order to maximize the expected profit. □ 
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C SK-CC Without Relaxing the Capacity 



We can slightly modify the algorithm developed in [12J to give a (2 + e)-approximation for SK- 
Can. We only state the modification. For each item 6j, let Bj = {&*} be the set of items where b\ 
corresponds to 6, canceled at size t. We say b\ is a small profit item if Pj ■ Pr[Xj < t] is less than 
eOPT, where Pj is the (fixed) profit of 6j and Xi is the size of 6j. Theorem 4 in |12| also holds for 
SK-Can. The only modification is how we bound the profit loss Ap in the two knapsack experiment. 
Let £(W, C) denote the event that b\ is placed into the first knapsack when the remaining capacity 
is C. Since the items realizes to a particular size after it is placed, the loss of expected profit is 

A P = ^Pr [£{b\, C)] ■Pr[C<X i <t] • P*. 

b\,c 

Since Pr[C < Xi < t] = Pr[X; > C] • Pr[Xj < t \ X t > C] < Pr[X, > C] • Pr[Xj < t], and 
Pr[Xj < t] ■ Pi < eOPT for small profit items, we have that 

A P < Pr[^(&i, C)] • Pr [^i > C] ' eOPT. 

Since ^ fet c Pr C)l ■ Pr[X > C] is the overflow probability, which is at most 1, we have 
Ap < eOPT. 

Moreover, the (l + e)-approximation algorithm (Lemma 8.1 in [24J) for finding an optimal policy 
that packs a constant number of items can be easily extended to SK-Can. Therefore, Theorem II .71 
follows. 

In SK-CC, if we assume that, for each item, the difference between the maximum and the 
minimum possible sizes is bounded by C — 5 for any constant 5 > 0, we can still obtain a factor 
2 approximation algorithm using the algorithm in [T2]. A careful examination of the proof shows 
that there is no profit loss in the two knapsack experiment. So all results in [12J continue to hold. 

Theorem C.l. There is a polynomial time algorithm that finds a 2-approximate adaptive policy 
for SK-CC if for each item, the difference between the maximum and the minimum possible sizes is 
bounded by C — 5 for any constant 5 > 0. 

D FPTAS for SK-U 

In this subsection, we give a linear time FPTAS for the stochastic knapsack problem where each 
item has unlimited number of copies (denoted as SK-U). SK-U is a classic Markov decision process 
(MDP) with continuous states has been studied extensively [251 ISS H] • Optimal adaptive policies 
(in this case, the decision of inserting which item only depends on the remaining capacity) have 
been characterized for some special distributions, e.g., exponential distributions |25j . We would 
like to note that, for continuous distributions with Lipschitz PDFs, we can use the general results 
on discretizing continuous MDP to get a PTAS for SK-U (see e.g., [56]). 

In this problem, we first apply the discretization in Section 12.11 In SK-U, a policy can also 
be represented as a function f a : [0, C] — > B. Since all discretized sizes are multiplies of e 5 , it is 
possible to find out the optimal canonical policy f a (s) by dynamic programming as follows. 
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We use DP(s) to denote that the expected profit of the optimal policy a on (ip,s). Initially, 



which is the expected profit of repeatedly packing copies of b until one copy realizes to a nonzero 
size. We compute all DP(s) values for s = S& in increasing order of k = 0, 1, . . . , <S — 1. Suppose we 
choose item b when the remaining capacity is s. Then DP(s) = J2k=o\P b ( Sk ) + 7r b( s fe)DP(s — Sfc)]. 
Therefore, the recursion of the dynamic program is the following: 



Each DP(s) can be computed with in 0(n ■ C/e 5 ) time, and there are 0(C/e 5 ) different values of s. 
Thus the running time of the dynamic program is 0(n ■ C 2 /e 10 ) = Oin). By applying the fast zero 
delay convolution in [22J, we can speed up the dynamic program and further reduce the running 
time to 0(n • C/e 5 log 2 (C/e 5 )). 

Since the policy can be represented as a decision tree, Lemma 14.21 still holds in this case. 
Theorem ID . 1 1 can be proved following the same argument as SK. 

Theorem D.l. For any e > 0, there is a linear time FPTAS for SK-U when the knapsack capacity 
is relaxed to C + e. 




oo 




k=l 
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